Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Nebius AI Studio
SambaNova
Fireworks
Nscale
Replicate
Novita
Hyperbolic
Cohere
fal
Cerebras
Featherless AI
Together AI
HF Inference API
Misc
Reset Misc
compressed-tensors
Inference Endpoints
text-generation-inference
8-bit precision
custom_code
Merge
Eval Results
Mixture of Experts
Carbon Emissions
text-embeddings-inference
Misc with no match
4-bit precision
Apply filters
Models
1,879
Full-text search
Edit filters
Sort: Trending
Active filters:
compressed-tensors
Clear all
clejordan/MNLP_M3_W4A16llmcompressor_manysamples
Text Generation
•
Updated
6 days ago
•
14
ema1234/qwen_mcqa_compressed_gptq_4bit
Text Generation
•
Updated
6 days ago
•
25
ema1234/qwen_mcqa_compressed_smoothquant_int8
Text Generation
•
Updated
6 days ago
•
7
ema1234/qwen_mcqa_compressed_sparseml_4bit
Text Generation
•
Updated
6 days ago
•
5
nm-testing/TinyLlama-1.1B-Chat-v1.0-NVFP4
Updated
6 days ago
•
150
HAissa/M3_smoothquant04
Text Generation
•
Updated
6 days ago
•
12
sajal09/Packed_W4_ASymmetric_Channel_A8_Symmetric_Token
Text Generation
•
Updated
6 days ago
•
1
HAissa/M3_smoothquant05
Text Generation
•
Updated
6 days ago
•
31
HAissa/M3_smoothquant06
Text Generation
•
Updated
6 days ago
•
9
sajal09/W4_ASymmetric_Channel_A8_Symmetric_Token
Text Generation
•
Updated
6 days ago
•
13
sajal09/W4_Symmetric_Group_128_A8_Symmetric_Token
Text Generation
•
Updated
6 days ago
•
9
luciehmct/MNLP_M3_quantized_model_W4A8
Text Generation
•
Updated
6 days ago
•
15
HAissa/M3_smoothquant07
Text Generation
•
Updated
6 days ago
•
9
luciehmct/MNLP_M3_quantized_model_W6A4
Text Generation
•
Updated
6 days ago
•
14
HAissa/M3_smoothquant09
Text Generation
•
Updated
6 days ago
•
23
sajal09/W4_Symmetric_Group_256_A8_Symmetric_Token
Text Generation
•
Updated
6 days ago
•
9
sajal09/W4_Symmetric_Tensor_A8_Symmetric_Token
Text Generation
•
Updated
6 days ago
•
8
HAissa/M3_smoothquant092
Text Generation
•
Updated
6 days ago
•
72
boboliu/Qwen3-Embedding-0.6B-W4A16-G128
Feature Extraction
•
Updated
5 days ago
•
39
RasmusVeski/OLD_llmcompressor_PTQW4A16
Text Generation
•
Updated
6 days ago
•
8
boboliu/Qwen3-Embedding-8B-W4A16-G128
Feature Extraction
•
Updated
5 days ago
•
242
RasmusVeski/OLD_llmcompressor_PTQW8A8
Text Generation
•
Updated
6 days ago
•
8
RasmusVeski/OLD_quantized_model_GPTQW8A16
Text Generation
•
Updated
6 days ago
•
7
boboliu/Qwen3-Embedding-4B-W4A16-G128
Feature Extraction
•
Updated
5 days ago
•
76
yassineturki/M3_gptq_only
Text Generation
•
Updated
6 days ago
•
37
sajal09/W8_Symmetric_Channel_A8_Symmetric_Token
Text Generation
•
Updated
6 days ago
•
9
sajal09/WithoutSmoothing
Text Generation
•
Updated
6 days ago
•
38
RasmusVeski/MNLP_M3_quantized_model_GPTQW8A16
Text Generation
•
Updated
6 days ago
•
12
HAissa/M3_smoothquant094
Text Generation
•
Updated
6 days ago
•
12
tranhuonglan/Qwen3-06B-base-quantization-modifier-W4A8
Text Generation
•
Updated
6 days ago
•
3
Previous
1
...
53
54
55
56
57
...
63
Next