ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Paper β’ 2504.11536 β’ Published 3 days ago β’ 33
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations Paper β’ 2504.10481 β’ Published 4 days ago β’ 73
RealHarm: A Collection of Real-World Language Model Application Failures Paper β’ 2504.10277 β’ Published 4 days ago β’ 10
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters Paper β’ 2504.08791 β’ Published 11 days ago β’ 110
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper β’ 2504.08685 β’ Published 7 days ago β’ 113
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis Paper β’ 2504.04842 β’ Published 11 days ago β’ 30
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper β’ 2504.06263 β’ Published 10 days ago β’ 143
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving Paper β’ 2504.02605 β’ Published 15 days ago β’ 43
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper β’ 2504.01990 β’ Published 18 days ago β’ 242
SANA-Sprint Collection πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation β’ 6 items β’ Updated 1 day ago β’ 35
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory β’ 8 items β’ Updated 15 days ago β’ 117
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes Paper β’ 2503.23461 β’ Published 19 days ago β’ 93
Transformers Use Causal World Models in Maze-Solving Tasks Paper β’ 2412.11867 β’ Published Dec 16, 2024 β’ 1
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy Paper β’ 2503.19757 β’ Published 24 days ago β’ 50