Spaces:

Ahmadzei
/

RAG

Runtime error

added 3 more tables for large emb model

5fa1a76 over 1 year ago

326 Bytes

	encoding = processor(image, question, return_tensors="pt")
	print(encoding.keys())
	dict_keys(['input_ids', 'token_type_ids', 'attention_mask', 'bbox', 'image'])

	Use case 5: visual question answering (inference), apply_ocr=False
	For visual question answering tasks (such as DocVQA), you can provide a question to the processor.