JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching Paper β’ 2506.23552 β’ Published 23 days ago β’ 9
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper β’ 2506.08279 β’ Published Jun 9 β’ 28
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper β’ 2506.08279 β’ Published Jun 9 β’ 28 β’ 2
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper β’ 2506.08279 β’ Published Jun 9 β’ 28
Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features Paper β’ 2504.00557 β’ Published Apr 1 β’ 15 β’ 2
Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features Paper β’ 2504.00557 β’ Published Apr 1 β’ 15
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation Paper β’ 2503.09641 β’ Published Mar 12 β’ 40
SANA-Sprint Collection πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation β’ 6 items β’ Updated Apr 17 β’ 43
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization Paper β’ 2412.17739 β’ Published Dec 23, 2024 β’ 42
FastVLM: Efficient Vision Encoding for Vision Language Models Paper β’ 2412.13303 β’ Published Dec 17, 2024 β’ 21
FashionComposer: Compositional Fashion Image Generation Paper β’ 2412.14168 β’ Published Dec 18, 2024 β’ 16