Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Internally, the processor first uses
[LayoutLMv2ImageProcessor] to apply OCR on the image to get a list of words and normalized
bounding boxes, as well to resize the image to a given size in order to get the image input.