Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

17

Full-text search

Active filters: llamacpp

DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters

Updated May 28 • 124

kroonen/neural-chat-7b-v3-1-GGUF

7B • Updated Nov 28, 2023 • 22

Druvith/mistralmed-7b-v1.5.gguf

7B • Updated Feb 3, 2024 • 5

rxavier/Taurus-7B-1.0-GGUF

7B • Updated Feb 25, 2024 • 54

BramVanroy/GEITje-7B-ultra-GGUF

7B • Updated Dec 6, 2024 • 320 • 9

Vikhrmodels/it-5.3-fp16-32k-GGUF

8B • Updated Jun 13, 2024 • 301 • 2

rubra-ai/Meta-Llama-3-8B-Instruct-GGUF

9B • Updated Jul 4, 2024 • 106 • 4

Vikhrmodels/it-5.4-fp16-orpo-v2-GGUF

8B • Updated Jul 14, 2024 • 284 • 4

Dracones/gemma-2-9b-it-GGUF

Text Generation • 9B • Updated Jul 20, 2024 • 8

Dracones/gemma-2-27b-it-GGUF

Text Generation • 27B • Updated Jul 22, 2024 • 64

Vikhrmodels/Vikhr-Gemma-2B-instruct-GGUF

Text Generation • 3B • Updated Aug 23, 2024 • 1.54k • 16

flowaicom/Flow-Judge-v0.1-GGUF

Text Generation • 4B • Updated Sep 18, 2024 • 60 • 9

Vikhrmodels/Vikhr-Llama-3.2-1B-instruct-GGUF

Text Generation • 1B • Updated Sep 27, 2024 • 901 • 12

Vikhrmodels/Vikhr-Qwen-2.5-0.5B-instruct-GGUF

Text Generation • 0.5B • Updated Oct 6, 2024 • 245 • 6

Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO_GGUF

2B • Updated Feb 3 • 235 • 9

Vikhrmodels/QVikhr-2.5-1.5B-Instruct-r_GGUF

2B • Updated Feb 11 • 285 • 4

vicharai/ViCoder-html-32B-preview-GGUF

Text Generation • 33B • Updated Apr 24 • 68 • 4