Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published 4 days ago • 79
mistralai/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text • Updated 14 days ago • 97k • • 1.14k
Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path? Paper • 2502.15657 • Published Feb 21 • 5
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper • 2412.05271 • Published Dec 6, 2024 • 155
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 988