NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation Paper • 2504.13055 • Published 6 days ago • 18
🚀 Active PRM Collection Efficient Process Reward Model Training via Active Learning. • 4 items • Updated 8 days ago • 3
Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published 28 days ago • 45
Efficient Process Reward Model Training via Active Learning Paper • 2504.10559 • Published 9 days ago • 13
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18 • 16
Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models Paper • 2412.05939 • Published Dec 8, 2024 • 16
🔱 Sailor2 Language Models Collection Sailing in South-East Asia with Inclusive Multilingual LLMs • 34 items • Updated Feb 24 • 27