BitNet Collection π₯BitNet family of large language models (1-bit LLMs). β’ 6 items β’ Updated about 2 hours ago β’ 19
RADIO Collection A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). β’ 12 items β’ Updated 3 days ago β’ 16
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory β’ 8 items β’ Updated 15 days ago β’ 117
SVDQuant Collection Models and datasets for "SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models" β’ 20 items β’ Updated Mar 17 β’ 25
distil-large-v3.5 Collection This collection contains the model repositories for distil-large-v3.5, which provides support for the most popular Whisper libraries. β’ 5 items β’ Updated 24 days ago β’ 7
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency Paper β’ 2503.20785 β’ Published 23 days ago β’ 21
ConsisID Collection Identity-Preserving Text-to-Video Generation by Frequency Decomposition β’ 4 items β’ Updated Dec 3, 2024 β’ 12
Enabling Versatile Controls for Video Diffusion Models Paper β’ 2503.16983 β’ Published 28 days ago β’ 14
SANA-Sprint Collection πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation β’ 6 items β’ Updated 1 day ago β’ 35
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation Paper β’ 2503.09641 β’ Published Mar 12 β’ 36
view article Article Welcome PaliGemma 2 β New vision language models by Google Dec 5, 2024 β’ 152