VLMs - a rivasmig Collection

rivasmig 's Collections

Copy

VLMs

Methods

Utility

VLMs

updated Apr 19

Task Vectors are Cross-Modal

Paper • 2410.22330 • Published Oct 29, 2024 • 11
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Paper • 2502.01341 • Published Feb 3 • 39
DASH: Detection and Assessment of Systematic Hallucinations of VLMs

Paper • 2503.23573 • Published Mar 30 • 13
Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10 • 134
SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 197
Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 80