EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models Paper • 2502.04424 • Published Feb 6 • 2
VisNumBench: Evaluating Number Sense of Multimodal Large Language Models Paper • 2503.14939 • Published Mar 19 • 5