Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving Paper • 2504.02605 • Published 19 days ago • 44 • 3
Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published 19 days ago • 53
Scaling Analysis of Interleaved Speech-Text Language Models Paper • 2504.02398 • Published 19 days ago • 27
AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation Paper • 2503.19693 • Published 28 days ago • 75
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14 • 97
OmnimatteZero: Training-free Real-time Omnimatte with Pre-trained Video Diffusion Models Paper • 2503.18033 • Published about 1 month ago • 24
A Survey on Large Language Model based Autonomous Agents Paper • 2308.11432 • Published Aug 22, 2023 • 3
More Documents, Same Length: Isolating the Challenge of Multiple Documents in RAG Paper • 2503.04388 • Published Mar 6 • 16