mistralai/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text • Updated 11 days ago • 134k • • 1.13k
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published Feb 20 • 142