Sleep-time Compute: Beyond Inference Scaling at Test-time Paper • 2504.13171 • Published 3 days ago • 13
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training Paper • 2504.13161 • Published 3 days ago • 84
Robust and Fine-Grained Detection of AI Generated Texts Paper • 2504.11952 • Published 4 days ago • 10
WORLDMEM: Long-term Consistent World Simulation with Memory Paper • 2504.12369 • Published 4 days ago • 28
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling Paper • 2504.13169 • Published 3 days ago • 36
MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft Paper • 2504.08388 • Published 9 days ago • 38
Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models Paper • 2504.03624 • Published 16 days ago • 13
How new data permeates LLM knowledge and how to dilute it Paper • 2504.09522 • Published 7 days ago • 6
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search Paper • 2504.08066 • Published 10 days ago • 10
Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability Paper • 2504.08003 • Published 11 days ago • 45
S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models Paper • 2504.10368 • Published 6 days ago • 20
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning Paper • 2504.08837 • Published 10 days ago • 40
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters Paper • 2504.08791 • Published 13 days ago • 116
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published 6 days ago • 228
Kimina Prover Preview Collection State-of-the-Art Models for Formal Mathematical Reasoning • 4 items • Updated 6 days ago • 26
JetMoE: Reaching Llama2 Performance with 0.1M Dollars Paper • 2404.07413 • Published Apr 11, 2024 • 39