Preprocess | |
The next step is to load a T5 tokenizer to process text and summary: | |
from transformers import AutoTokenizer | |
checkpoint = "google-t5/t5-small" | |
tokenizer = AutoTokenizer.from_pretrained(checkpoint) | |
The preprocessing function you want to create needs to: | |
Prefix the input with a prompt so T5 knows this is a summarization task. |