Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

17

Full-text search

Active filters: efficiency

Shahradmz/HyenaDistilledPythia70M

Text Generation • Updated Jan 10, 2024

sapienzanlp/maverick-mes-litbank

Updated Aug 12, 2024 • 7 • 4

1-800-LLMs/Qwen-2.5-14B-Hindi

15B • Updated Feb 13 • 2

1024m/PHI-4-Hindi

15B • Updated Feb 13 • 1

1024m/PHI-4-Hindi-LoRA

large-traversaal/Mantra-14B

15B • Updated Apr 13 • 8 • 2

DrishtiSharma/qwen-2.5-14b

large-traversaal/Qwen-2.5-14B-Hindi

15B • Updated Mar 3 • 12 • 4

mradermacher/Qwen-2.5-14B-Hindi-GGUF

15B • Updated Mar 3 • 99 • 1

sst12345/CoRe2

Text-to-Image • Updated Mar 18 • 2

mradermacher/Mantra-14B-GGUF

15B • Updated 18 days ago • 129

mradermacher/Mantra-14B-i1-GGUF

15B • Updated 18 days ago • 196

codelion/Qwen3-0.6B-accuracy-recovery-lora

Text Generation • Updated 16 days ago • 8 • 1

prompterminal/gpt2-compressed

Text Generation • 2B • Updated 7 days ago • 7

GY2233/R2R_router_qwen3-1.7b

Text Classification • Updated 7 days ago • 2

GY2233/R2R_router_qwen3-4b

Text Classification • Updated 6 days ago • 4

GY2233/R2R_router_qwenr1

Text Classification • Updated 5 days ago • 3