SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging Paper • 2504.10642 • Published 8 days ago • 1
RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity Paper • 2504.13099 • Published 5 days ago • 3
RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search Paper • 2504.15047 • Published 1 day ago • 4
LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark Paper • 2504.13805 • Published 4 days ago • 8
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation Paper • 2504.14899 • Published 1 day ago • 10
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners Paper • 2504.14239 • Published 4 days ago • 11
EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models Paper • 2504.15133 • Published 1 day ago • 15
LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs Paper • 2504.14655 • Published 2 days ago • 13
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs Paper • 2504.15280 • Published 1 day ago • 13
X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents Paper • 2504.13203 • Published 7 days ago • 24
StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians Paper • 2504.15281 • Published 1 day ago • 20
FlowReasoner: Reinforcing Query-Level Meta-Agents Paper • 2504.15257 • Published 1 day ago • 35
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models Paper • 2504.15271 • Published 1 day ago • 48
Generative AI Act II: Test Time Scaling Drives Cognition Engineering Paper • 2504.13828 • Published 4 days ago • 17
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations Paper • 2504.13816 • Published 4 days ago • 14
Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models Paper • 2504.13626 • Published 5 days ago • 7
It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization Paper • 2504.13173 • Published 5 days ago • 16
AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis Paper • 2504.13157 • Published 5 days ago • 17