Submitted by wenyi 228 GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning · 77 authors 1.48k 4
Submitted by yuexiang96 73 Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning · 9 authors 86 2
Submitted by yilunzhao 45 SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks · 18 authors 47 2
Submitted by Lmxyy 41 Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation · 14 authors 483 3
Submitted by Haon-Chen 37 MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings · 7 authors 53 1
Submitted by Sansa 29 DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation · 7 authors 709 3
Submitted by RanjanSapkota 22 Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact · 20 authors 4
Submitted by fushh7 15 HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context · 10 authors 120 1
Submitted by Amar-S 12 Training for X-Ray Vision: Amodal Segmentation, Amodal Content Completion, and View-Invariant Object Representation from Multi-Camera Video · 5 authors 1
Submitted by puar-playground 10 MusiXQA: Advancing Visual Music Understanding in Multimodal Large Language Models · 9 authors 2 1
Submitted by Simase 7 FreeLong++: Training-Free Long Video Generation via Multi-band SpectralFusion · 2 authors 1
Submitted by amanchadha 6 Peccavi: Visual Paraphrase Attack Safe and Distortion Free Image Watermarking Technique for AI-Generated Images · 7 authors 1
Submitted by AdinaY 5 IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering · 10 authors 39 1
Submitted by huxueyu 3 Mixture of Reasonings: Teach Large Language Models to Reason with Adaptive Strategies · 4 authors 1
Submitted by AmirHossein-razlighi 3 Confident Splatting: Confidence-Based Compression of 3D Gaussian Splatting via Learnable Beta Distributions · 3 authors 1
Submitted by Peter2023HuggingFace 2 FreNBRDF: A Frequency-Rectified Neural Material Representation · 3 authors 0 1