XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference Paper • 2404.15420 • Published Apr 23, 2024 • 11
BASS: Batched Attention-optimized Speculative Sampling Paper • 2404.15778 • Published Apr 24, 2024 • 11
MaGGIe: Masked Guided Gradual Human Instance Matting Paper • 2404.16035 • Published Apr 24, 2024 • 12
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning Paper • 2404.15449 • Published Apr 23, 2024 • 14
MotionMaster: Training-free Camera Motion Transfer For Video Generation Paper • 2404.15789 • Published Apr 24, 2024 • 13
CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data Paper • 2404.15653 • Published Apr 24, 2024 • 30
PuLID: Pure and Lightning ID Customization via Contrastive Alignment Paper • 2404.16022 • Published Apr 24, 2024 • 26
MagicProp: Diffusion-based Video Editing via Motion-aware Appearance Propagation Paper • 2309.00908 • Published Sep 2, 2023 • 6
Sequential Dexterity: Chaining Dexterous Policies for Long-Horizon Manipulation Paper • 2309.00987 • Published Sep 2, 2023 • 4
Compositional Diffusion-Based Continuous Constraint Solvers Paper • 2309.00966 • Published Sep 2, 2023 • 6
Contrastive Feature Masking Open-Vocabulary Vision Transformer Paper • 2309.00775 • Published Sep 2, 2023 • 10
Doppelgangers: Learning to Disambiguate Images of Similar Structures Paper • 2309.02420 • Published Sep 5, 2023 • 11
StyleAdapter: A Single-Pass LoRA-Free Model for Stylized Image Generation Paper • 2309.01770 • Published Sep 4, 2023 • 12
Hierarchical Masked 3D Diffusion Model for Video Outpainting Paper • 2309.02119 • Published Sep 5, 2023 • 12
PromptTTS 2: Describing and Generating Voices with Text Prompt Paper • 2309.02285 • Published Sep 5, 2023 • 13