Collections
Discover the best community collections!
Collections trending this week
-
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
Paper • 2504.10479 • Published • 222 -
OpenGVLab/InternVL3-1B
Image-Text-to-Text • Updated • 7.43k • 41 -
OpenGVLab/InternVL3-2B
Image-Text-to-Text • Updated • 2.97k • 13 -
OpenGVLab/InternVL3-8B
Image-Text-to-Text • Updated • 11.5k • 33
-
microsoft/bitnet-b1.58-2B-4T
Text Generation • Updated • 4.62k • 393 -
microsoft/bitnet-b1.58-2B-4T-bf16
Text Generation • Updated • 385 • 10 -
microsoft/bitnet-b1.58-2B-4T-gguf
Text Generation • Updated • 4.24k • 72 -
BitNet b1.58 2B4T Technical Report
Paper • 2504.12285 • Published • 47
-
meta-llama/Llama-4-Scout-17B-16E-Instruct
Image-Text-to-Text • Updated • 696k • • 793 -
meta-llama/Llama-4-Scout-17B-16E
Image-Text-to-Text • Updated • 27.3k • 149 -
meta-llama/Llama-4-Maverick-17B-128E-Instruct
Image-Text-to-Text • Updated • 43.9k • • 297 -
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
Image-Text-to-Text • Updated • 40.4k • • 101
-
google/gemma-3-4b-it-qat-q4_0-gguf
Image-Text-to-Text • Updated • 9.46k • 91 -
google/gemma-3-4b-pt-qat-q4_0-gguf
Image-Text-to-Text • Updated • 412 • 14 -
google/gemma-3-1b-it-qat-q4_0-gguf
Text Generation • Updated • 3.28k • 27 -
google/gemma-3-1b-pt-qat-q4_0-gguf
Text Generation • Updated • 160 • 6
-
Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Paper • 2504.03624 • Published • 13 -
nvidia/Nemotron-H-56B-Base-8K
Text Generation • Updated • 348 • 24 -
nvidia/Nemotron-H-47B-Base-8K
Text Generation • Updated • 324 • 14 -
nvidia/Nemotron-H-8B-Base-8K
Text Generation • Updated • 2.91k • 33
-
ibm-granite/granite-3.3-2b-base
Text Generation • Updated • 213 • 4 -
ibm-granite/granite-3.3-2b-instruct
Text Generation • Updated • 21.1k • 16 -
ibm-granite/granite-3.3-8b-base
Text Generation • Updated • 182 • 7 -
ibm-granite/granite-3.3-8b-instruct
Text Generation • Updated • 2.43k • 42