FocusedAD: Character-centric Movie Audio Description Paper β’ 2504.12157 β’ Published 7 days ago β’ 9
In-2-4D: Inbetweening from Two Single-View Images to 4D Generation Paper β’ 2504.08366 β’ Published 12 days ago β’ 9
FlexIP: Dynamic Control of Preservation and Personality for Customized Image Generation Paper β’ 2504.07405 β’ Published 13 days ago β’ 12
LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models Paper β’ 2504.10430 β’ Published 9 days ago β’ 4
NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors Paper β’ 2504.11427 β’ Published 8 days ago β’ 17
Cobra: Efficient Line Art COlorization with BRoAder References Paper β’ 2504.12240 β’ Published 7 days ago β’ 27
WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments Paper β’ 2504.03886 β’ Published 19 days ago β’ 10
Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation Paper β’ 2503.24379 β’ Published 23 days ago β’ 75
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors Paper β’ 2504.01016 β’ Published 22 days ago β’ 29
Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals Paper β’ 2503.19953 β’ Published 29 days ago β’ 3
MusicInfuser: Making Video Diffusion Listen and Dance Paper β’ 2503.14505 β’ Published Mar 18 β’ 11
LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds Paper β’ 2503.10625 β’ Published Mar 13 β’ 32
OmnimatteZero: Training-free Real-time Omnimatte with Pre-trained Video Diffusion Models Paper β’ 2503.18033 β’ Published Mar 23 β’ 24
Can Vision-Language Models Answer Face to Face Questions in the Real-World? Paper β’ 2503.19356 β’ Published 29 days ago β’ 2
FRESA:Feedforward Reconstruction of Personalized Skinned Avatars from Few Images Paper β’ 2503.19207 β’ Published 30 days ago β’ 4
DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis Paper β’ 2503.15667 β’ Published Mar 19 β’ 8