[ViltProcessor] wraps a BERT tokenizer and ViLT image processor into a convenient single processor: | |
from transformers import ViltProcessor | |
processor = ViltProcessor.from_pretrained(model_checkpoint) | |
To preprocess the data we need to encode the images and questions using the [ViltProcessor]. |