Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

2,414

Full-text search

Active filters: quantized

vlad-m-dev/distiluse-base-multilingual-v2-merged-onnx

Feature Extraction • Updated Jun 22 • 1

mehta/CooperLM-354M-4bit

Text Generation • 0.2B • Updated Jun 21 • 4 • 1

steampunque/Mistral-Small-3.2-24B-Instruct-2506-Hybrid-GGUF

0.4B • Updated Jun 21 • 25

ReallyFloppyPenguin/Polaris-4B-Preview-GGUF

4B • Updated Jun 23 • 50

ReallyFloppyPenguin/Arch-Agent-7B-GGUF

8B • Updated Jun 23 • 35

ReallyFloppyPenguin/Nanonets-OCR-s-GGUF

3B • Updated Jun 23 • 101

kanrishaurus/llama3-8b-sahabatai-v1-instruct-GGUF

Text Generation • 8B • Updated Jun 23 • 28

steampunque/Qwen2.5-VL-7B-Instruct-Hybrid-GGUF

0.7B • Updated Jun 24 • 7

TheMelonGod/Jan-nano-exl2

Text Generation • Updated 30 days ago • 17

NVFP4/DeepSeek-Prover-V2-7B-FP4

4B • Updated 6 days ago • 4

NVFP4/DeepSeek-R1-0528-Qwen3-8B-FP4

5B • Updated 6 days ago • 5

NVFP4/Qwen3-32B-FP4

Text Generation • 19B • Updated 6 days ago • 16

NVFP4/Polaris-4B-Preview-FP4

2B • Updated 6 days ago • 19

NVFP4/Polaris-7B-Preview-FP4

5B • Updated 6 days ago • 3

hdtrnk/Wan2.1_Phantom_FusioniX

Image-to-Video • Updated Jun 25 • 2

PinkPixel/Crystal-Think-V2-GGUF

Text Generation • 4B • Updated Jun 26 • 49 • 1

PinkPixel/Crystal-Think-V2-Imatrix-GGUF

Text Generation • 4B • Updated Jun 26 • 74 • 1

muranAI/DeepSeek-R1-0528-Qwen3-8B-GGUF

8B • Updated Jun 28 • 394 • 1

hrsvrn/Flux.1-kontext-dev-gguf

12B • Updated Jun 26 • 31

onnx-community/distiluse-base-multilingual-v2-merged-onnx

Feature Extraction • Updated Jun 26 • 1

muranAI/gemma-3n-E4B-it-GGUF

Text Generation • 7B • Updated Jun 28 • 1.9k • 2

Durlabh/ai-fitness-qwen

Text Generation • 2B • Updated about 1 month ago • 42

agentlans/gemma-3-4b-it-GGUF

4B • Updated Jun 27 • 44

lym00/Wan2.1_T2V_1.3B_SelfForcing_VACE-GGUF

Image-to-Video • 2B • Updated Jun 28 • 827 • 2

muranAI/Mistral-Small-3.1-24B-Instruct-2503-GGUF

Text Generation • 24B • Updated Jun 28 • 558 • 1

vnyaryan/playwright4model_q4_k_m

Text Generation • 3B • Updated Jun 28 • 18

vnyaryan/playwright5model_q4_k_m

Text Generation • 3B • Updated Jun 28 • 15

agentlans/Qwen3-4B-multilingual-sft

4B • Updated about 1 month ago • 6

agentlans/Qwen3-4B-multilingual-sft-GGUF

Text Generation • 4B • Updated about 1 month ago • 55

TheMelonGod/Irixxed-Magcap-12B-Slerp-exl2

Text Generation • Updated 29 days ago • 7