metadata
license: mit
pipeline_tag: image-text-to-text
This model is in the paper Docopilot: Improving Multimodal Models for Document-Level Understanding.
Please refer to https://github.com/OpenGVLab/Docopilot for details.
license: mit
pipeline_tag: image-text-to-text
This model is in the paper Docopilot: Improving Multimodal Models for Document-Level Understanding.
Please refer to https://github.com/OpenGVLab/Docopilot for details.