AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper • 2502.01341 • Published Feb 3 • 39
DASH: Detection and Assessment of Systematic Hallucinations of VLMs Paper • 2503.23573 • Published 25 days ago • 12
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 17 days ago • 171