ParrotRouter/Qwen3-4B-Thinking-2507-20250813-033307-1 Text Generation • 4B • Updated 8 days ago • 7 • 1
InternVL3.5 Collection This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 45 items • Updated about 18 hours ago • 72
AFM-Models Collection The models and training dataset of the paper: Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL • 12 items • Updated 23 days ago • 16