AMFT: Aligning LLM Reasoners by Meta-Learning the Optimal Imitation-Exploration Balance Paper • 2508.06944 • Published 13 days ago • 1
AMFT: Aligning LLM Reasoners by Meta-Learning the Optimal Imitation-Exploration Balance Paper • 2508.06944 • Published 13 days ago • 1
AMFT: Aligning LLM Reasoners by Meta-Learning the Optimal Imitation-Exploration Balance Paper • 2508.06944 • Published 13 days ago • 1 • 2
Mitigating Geospatial Knowledge Hallucination in Large Language Models: Benchmarking and Dynamic Factuality Aligning Paper • 2507.19586 • Published 28 days ago
Matrix-3D: Omnidirectional Explorable 3D World Generation Paper • 2508.08086 • Published 11 days ago • 67
MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh Paper • 2508.01242 • Published 20 days ago • 9
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published 14 days ago • 155
DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning Paper • 2508.05405 • Published 15 days ago • 61
Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation Paper • 2508.05635 • Published 15 days ago • 71
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published 15 days ago • 154
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success Paper • 2508.04280 • Published 16 days ago • 34
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Paper • 2508.02193 • Published 18 days ago • 126
Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation Paper • 2508.03320 • Published 17 days ago • 59