InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners Paper • 2504.14239 • Published 4 days ago • 11
AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis Paper • 2504.13157 • Published 5 days ago • 17
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published 4 days ago • 84
LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks Paper • 2412.15204 • Published Dec 19, 2024 • 38
Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT? Paper • 2504.11741 • Published 7 days ago • 1
Iterative Self-Training for Code Generation via Reinforced Re-Ranking Paper • 2504.09643 • Published 9 days ago • 34
Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning Paper • 2504.08672 • Published 11 days ago • 53
SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models Paper • 2504.11468 • Published 12 days ago • 26
WORLDMEM: Long-term Consistent World Simulation with Memory Paper • 2504.12369 • Published 6 days ago • 30
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Paper • 2504.11536 • Published 7 days ago • 56
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories Paper • 2504.08942 • Published 11 days ago • 27