Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning Paper • 2504.08672 • Published 10 days ago • 52
Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models Paper • 2504.07951 • Published 11 days ago • 26
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published 7 days ago • 232
MM-IFEngine: Towards Multimodal Instruction Following Paper • 2504.07957 • Published 11 days ago • 33
Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill? Paper • 2504.06514 • Published 13 days ago • 38
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Paper • 2504.05599 • Published 14 days ago • 79
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 14 days ago • 164
ZClip: Adaptive Spike Mitigation for LLM Pre-Training Paper • 2504.02507 • Published 18 days ago • 76
Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published 26 days ago • 43
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 3 days ago • 151
PaperBench: Evaluating AI's Ability to Replicate AI Research Paper • 2504.01848 • Published 19 days ago • 35
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction Paper • 2504.01014 • Published 20 days ago • 62
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models Paper • 2502.06608 • Published Feb 10 • 40
Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources Paper • 2504.00595 • Published 20 days ago • 34