Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper β’ 2503.16219 β’ Published Mar 20 β’ 47
Running on Zero 61 61 Splatt3R - Zero-shot Gaussian Splatting from Uncalibarated Image Pairs β° Generate 3D scenes from one or two images
Running on L4 1.82k 1.82k MagicQuill πͺΆ Edit and enhance images with custom color and edge modifications
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL Paper β’ 2503.07536 β’ Published Mar 10 β’ 85
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models Paper β’ 2503.10437 β’ Published Mar 13 β’ 32
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper β’ 2502.15007 β’ Published Feb 20 β’ 175
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper β’ 2502.14499 β’ Published Feb 20 β’ 192
ReLearn: Unlearning via Learning for Large Language Models Paper β’ 2502.11190 β’ Published Feb 16 β’ 29
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering? Paper β’ 2502.12115 β’ Published Feb 17 β’ 45
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper β’ 2502.08910 β’ Published Feb 13 β’ 149