Michael Goin's picture

Michael Goin PRO

mgoin

AI & ML interests

LLM inference optimization, compression, quantization, pruning, distillation

Recent Activity

published a model 2 days ago
neuralmagic/Qwen2.5-3B-quantized.w8a8
published a model 2 days ago
neuralmagic/Qwen2.5-14B-quantized.w8a8
published a model 2 days ago
neuralmagic/Qwen2.5-14B-FP8-dynamic
View all activity

Organizations

Neural Magic's profile picture garage-bAInd's profile picture Blog-explorers's profile picture Revel Labs's profile picture ZeroGPU Explorers's profile picture NM Testing's profile picture MLX Community's profile picture Social Post Explorers's profile picture Red Hat's profile picture

mgoin's activity

How to load this model?

2
#1 opened 7 months ago by
Frz614

Model does not run with VLLM

2
#3 opened about 1 month ago by
aswad546

Thanks!

#2 opened about 2 months ago by
Jindows
New activity in neuralmagic/pixtral-12b-FP8-dynamic 3 months ago

Update model card

#1 opened 3 months ago by
nm-research

Oom with 24g vram

3
#1 opened 4 months ago by
Klopez