NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation Paper • 2504.13055 • Published 3 days ago • 16
Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published 25 days ago • 43