PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines Paper • 2504.14738 • Published 3 days ago • 3
NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning Paper • 2504.13941 • Published 8 days ago • 6
RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search Paper • 2504.15047 • Published 3 days ago • 6
DRAGON: Distributional Rewards Optimize Diffusion Generative Models Paper • 2504.15217 • Published 2 days ago • 10
LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark Paper • 2504.13805 • Published 5 days ago • 10
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners Paper • 2504.14239 • Published 5 days ago • 12
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation Paper • 2504.14899 • Published 3 days ago • 14
LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs Paper • 2504.14655 • Published 3 days ago • 18
EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models Paper • 2504.15133 • Published 3 days ago • 18
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs Paper • 2504.15280 • Published 2 days ago • 18
THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models Paper • 2504.13367 • Published 6 days ago • 23
StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians Paper • 2504.15281 • Published 2 days ago • 23
X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents Paper • 2504.13203 • Published 8 days ago • 27
SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation Paper • 2504.14396 • Published 4 days ago • 27