Michael Goin's picture

Michael Goin

mgoin

AI & ML interests

LLM inference optimization, compression, quantization, pruning, distillation

Recent Activity

Organizations

Neural Magic's profile picture garage-bAInd's profile picture Blog-explorers's profile picture Mistral AI_'s profile picture ZeroGPU Explorers's profile picture NM Testing's profile picture Red Hat AI's profile picture

mgoin's activity

New activity in RedHatAI/Qwen2.5-VL-72B-Instruct-quantized.w8a8 about 2 months ago
New activity in RedHatAI/Qwen2.5-VL-72B-Instruct-FP8-Dynamic about 2 months ago
New activity in RedHatAI/Meta-Llama-3-8B-Instruct-FP8-KV 3 months ago

How to load this model?

2
#1 opened 10 months ago by
Frz614

Thanks!

1
#2 opened 4 months ago by
Jindows