KatoHinata's picture

32 9

KatoHinata

KatoHinata

AI & ML interests

None yet

Organizations

None yet

upvoted 10 papers 4 months ago

XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference

Paper • 2404.15420 • Published Apr 23, 2024 • 11

BASS: Batched Attention-optimized Speculative Sampling

Paper • 2404.15778 • Published Apr 24, 2024 • 11

MaGGIe: Masked Guided Gradual Human Instance Matting

Paper • 2404.16035 • Published Apr 24, 2024 • 12

ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning

Paper • 2404.15449 • Published Apr 23, 2024 • 14

MotionMaster: Training-free Camera Motion Transfer For Video Generation

Paper • 2404.15789 • Published Apr 24, 2024 • 13

Editable Image Elements for Controllable Synthesis

Paper • 2404.16029 • Published Apr 24, 2024 • 12

MoDE: CLIP Data Experts via Clustering

Paper • 2404.16030 • Published Apr 24, 2024 • 15

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Paper • 2404.15653 • Published Apr 24, 2024 • 30

PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Paper • 2404.16022 • Published Apr 24, 2024 • 26

Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published Apr 21 • 159

upvoted 10 papers 6 months ago

MagicProp: Diffusion-based Video Editing via Motion-aware Appearance Propagation

Paper • 2309.00908 • Published Sep 2, 2023 • 6

Diffusion Generative Inverse Design

Paper • 2309.02040 • Published Sep 5, 2023 • 5

Sequential Dexterity: Chaining Dexterous Policies for Long-Horizon Manipulation

Paper • 2309.00987 • Published Sep 2, 2023 • 4

Compositional Diffusion-Based Continuous Constraint Solvers

Paper • 2309.00966 • Published Sep 2, 2023 • 6

Gated recurrent neural networks discover attention

Paper • 2309.01775 • Published Sep 4, 2023 • 10

Contrastive Feature Masking Open-Vocabulary Vision Transformer

Paper • 2309.00775 • Published Sep 2, 2023 • 10

Doppelgangers: Learning to Disambiguate Images of Similar Structures

Paper • 2309.02420 • Published Sep 5, 2023 • 11

StyleAdapter: A Single-Pass LoRA-Free Model for Stylized Image Generation

Paper • 2309.01770 • Published Sep 4, 2023 • 12

Hierarchical Masked 3D Diffusion Model for Video Outpainting

Paper • 2309.02119 • Published Sep 5, 2023 • 12

PromptTTS 2: Describing and Generating Voices with Text Prompt

Paper • 2309.02285 • Published Sep 5, 2023 • 13