Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

202

Full-text search

Active filters: torchao

medmekk/Llama-3.2-1B-ao-float8da8w

Text Generation • Updated Apr 22 • 13

medmekk/Llama-3.2-1B-ao-autoquant-1

Text Generation • Updated Apr 22 • 13

medmekk/Llama-3.2-1B-ao-float8wo-2

Text Generation • Updated Apr 22 • 13

medmekk/Llama-3.2-1B-ao-float8wo-3

Text Generation • Updated Apr 22 • 13

medmekk/Llama-3.2-1B-ao-int8wo-gs256

Text Generation • Updated Apr 22 • 11

medmekk/Llama-3.2-1B-ao-int4wo-gs128

Text Generation • Updated Apr 22 • 3

medmekk/Qwen2.5-0.5B-Instruct-ao-float8wo

Text Generation • Updated Apr 22 • 14

medmekk/Llama-3.2-1B-ao-int4wo-gs256

Text Generation • Updated Apr 22 • 2

medmekk/Qwen2.5-VL-7B-Instruct-ao-float8wo

Updated Apr 24 • 4

medmekk/Qwen2.5-VL-7B-Instruct-ao-int8wo

Updated Apr 24 • 3

medmekk/Llama-3.1-8B-Instruct-ao-int8wo

Text Generation • Updated Apr 24 • 12

medmekk/Qwen2.5-VL-7B-Instruct-ao-int8da8w8

Updated Apr 24 • 3

medmekk/Llama-3.1-8B-Instruct-ao-autoquant

Text Generation • Updated Apr 24 • 12

medmekk/Llama-3.1-8B-Instruct-ao-int4wo-gs128

Text Generation • Updated Apr 24 • 5

medmekk/Llama-3.1-8B-Instruct-ao-float8wo

Text Generation • Updated Apr 24 • 14

medmekk/Llama-3.1-8B-Instruct-ao-float8da8w8

Text Generation • Updated Apr 24 • 13

medmekk/Llama-3.1-8B-Instruct-ao-int8da8w8

Text Generation • Updated Apr 24 • 12

medmekk/Llama-3.1-8B-Instruct-ao-float8da8w8-2

Text Generation • Updated Apr 24 • 13

medmekk/Llama-3.1-8B-Instruct-ao-int4wo-gs32

Text Generation • Updated Apr 24 • 2

medmekk/Llama-3.1-8B-Instruct-ao-int4wo-gs16

Text Generation • Updated Apr 24 • 3

Erland/vanilla-340M-4096-model-AO-W4

Text Generation • Updated May 21 • 13

irresistiblegrace97/TinyLlama-1.1B-Chat-v1.0-torchao-int4_weight_only-gs_4096

Updated Apr 24 • 1

Erland/softpick-340M-4096-model-AO-W4

Text Generation • Updated May 21 • 13

Erland/softpick-340M-4096-model-AO-W4A4

Text Generation • Updated May 21 • 13

Erland/vanilla-340M-4096-model-AO-W4A4

Text Generation • Updated May 21 • 13

irresistiblegrace97/tinyllama.gguf

Updated Apr 24 • 2

jerryzh168/opt-125m-int4wo

Text Generation • Updated Apr 25 • 3

pytorch/Qwen3-8B-int4wo-hqq

Text Generation • Updated 6 days ago • 50

pytorch/Qwen3-32B-float8dq

Text Generation • Updated May 29 • 19

jerryzh168/opt-125m-int4wo-per-module

Text Generation • Updated May 29 • 912