LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark Paper • 2504.13805 • Published 6 days ago • 10
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Paper • 2502.09621 • Published Feb 13 • 28