Learning Adaptive Parallel Reasoning with Language Models Paper ⢠2504.15466 ⢠Published 3 days ago ⢠36
World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models Paper ⢠2306.08685 ⢠Published Jun 14, 2023 ⢠1
Inversion-Free Image Editing with Natural Language Paper ⢠2312.04965 ⢠Published Dec 7, 2023 ⢠2
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL Paper ⢠2402.19446 ⢠Published Feb 29, 2024
DANLI: Deliberative Agent for Following Natural Language Instructions Paper ⢠2210.12485 ⢠Published Oct 22, 2022
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning Paper ⢠2405.10292 ⢠Published May 16, 2024 ⢠2
Training Software Engineering Agents and Verifiers with SWE-Gym Paper ⢠2412.21139 ⢠Published Dec 30, 2024 ⢠23
SWE-Gym/MoatlessTools-Agent-Verifier-Train-Data Viewer ⢠Updated Dec 23, 2024 ⢠2.16k ⢠42 ⢠1
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper ⢠2407.16741 ⢠Published Jul 23, 2024 ⢠72
Advancing LLM Reasoning Generalists with Preference Trees Paper ⢠2404.02078 ⢠Published Apr 2, 2024 ⢠47
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales Paper ⢠2405.20974 ⢠Published May 31, 2024