Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation Paper • 2503.24379 • Published 20 days ago • 74
QueryAgent: A Reliable and Efficient Reasoning Framework with Environmental Feedback-based Self-Correction Paper • 2403.11886 • Published Mar 18, 2024 • 1
Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments Paper • 2403.08593 • Published Mar 13, 2024 • 1
VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model Paper • 2502.18906 • Published Feb 26 • 12
VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model Paper • 2502.18906 • Published Feb 26 • 12
VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model Paper • 2502.18906 • Published Feb 26 • 12 • 2
ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation Paper • 2502.09411 • Published Feb 13 • 19
SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution Paper • 2501.05040 • Published Jan 9 • 15
Large Action Models: From Inception to Implementation Paper • 2412.10047 • Published Dec 13, 2024 • 35
Large Action Models: From Inception to Implementation Paper • 2412.10047 • Published Dec 13, 2024 • 35 • 5
Large Action Models: From Inception to Implementation Paper • 2412.10047 • Published Dec 13, 2024 • 35
Large Action Models: From Inception to Implementation Paper • 2412.10047 • Published Dec 13, 2024 • 35 • 5
Large Language Model-Brained GUI Agents: A Survey Paper • 2411.18279 • Published Nov 27, 2024 • 32 • 3