Mavors: Multi-granularity Video Representation for Multimodal Large Language Model Paper • 2504.10068 • Published 9 days ago • 30
MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models Paper • 2504.03641 • Published 18 days ago • 14
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment Paper • 2502.10391 • Published Feb 14 • 35