Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
101
19
21
Michael Goin
mgoin
Follow
Fishtiks's profile picture
nickandbro's profile picture
shubhrapandit's profile picture
41 followers
·
12 following
mgoin_
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
updated
a model
8 days ago
RedHatAI/Llama-3.2-1B-FP8
new
activity
22 days ago
kernels-community/vllm-flash-attn3:
Support for B200s?
liked
a model
about 1 month ago
moondream/moondream3-preview
View all activity
Organizations
mgoin
's models
100
Sort: Recently updated
mgoin/Qwen3-0.6B-NVFP4
0.6B
•
Updated
Aug 26
•
2
mgoin/mlperf-inference-llama3.1-8b-data
Updated
Jul 15
mgoin/Llama-3.1-8B-Instruct-FP8-BLOCK
8B
•
Updated
Jul 1
mgoin/SEMIKONG-70B-W4A16-G128
11B
•
Updated
Jun 16
mgoin/llama-4-tiny-random
Text Generation
•
6.69M
•
Updated
May 14
mgoin/Qwen1.5-14B-Chat-GPTQ
Text Generation
•
Updated
Mar 5
mgoin/pixtral-12b
Image-Text-to-Text
•
13B
•
Updated
Feb 7
•
177
•
1
mgoin/Llama-3.2-1B-Instruct-FP8-ATTN
1B
•
Updated
Dec 23, 2024
mgoin/Llama-3.2-1B-Instruct-FP8-dynamic-ATTN
1B
•
Updated
Dec 23, 2024
mgoin/Pixtral-Large-Instruct-2411
Updated
Nov 19, 2024
•
1
mgoin/Qwen2.5-Coder-32B-Instruct-fp8
Updated
Nov 13, 2024
mgoin/nemotron-3-8b-chat-4k-sft-hf
Text Generation
•
9B
•
Updated
Nov 13, 2024
mgoin/llava-onevision-qwen2-7b-ov-hf-bnb-full-4bit
Image-to-Text
•
5B
•
Updated
Nov 5, 2024
•
4
mgoin/MiniCPM-Llama3-V-2_5-int4
Visual Question Answering
•
5B
•
Updated
Oct 31, 2024
mgoin/DeepSeek-Coder-V2-Lite-Instruct-FP8
16B
•
Updated
Sep 20, 2024
•
20
mgoin/Mixtral-8x7B-Instruct-v0.1-FP8
47B
•
Updated
Sep 20, 2024
mgoin/Nemotron-nemo-checkpoints
Updated
Aug 30, 2024
mgoin/Minitron-4B-Base-FP8
Text Generation
•
4B
•
Updated
Aug 16, 2024
•
2
•
3
mgoin/Nemotron-4-340B-Base-hf
Text Generation
•
341B
•
Updated
Aug 8, 2024
•
2
•
1
mgoin/Nemotron-4-340B-Instruct-hf-FP8
Text Generation
•
341B
•
Updated
Aug 8, 2024
•
26
•
3
mgoin/Nemotron-4-340B-Base-hf-FP8
Text Generation
•
341B
•
Updated
Aug 8, 2024
•
151
•
2
mgoin/Nemotron-4-340B-Instruct-hf
Text Generation
•
341B
•
Updated
Aug 8, 2024
•
35
•
4
mgoin/SparseLLama-2-7b-ultrachat_200k-pruned_50.2of4-compressed-tensors
4B
•
Updated
Aug 5, 2024
mgoin/Minitron-8B-Base-FP8
Text Generation
•
8B
•
Updated
Jul 26, 2024
•
3
mgoin/Nemotron-4-340B-Instruct-FP8-Dynamic
Text Generation
•
341B
•
Updated
Jul 23, 2024
mgoin/Nemotron-4-340B-Instruct-vllm
Text Generation
•
341B
•
Updated
Jul 23, 2024
mgoin/Mistral-Nemo-Instruct-2407-FP8-KV
Text Generation
•
12B
•
Updated
Jul 18, 2024
•
8
mgoin/Mistral-Nemo-Instruct-2407-FP8-Dynamic
Text Generation
•
12B
•
Updated
Jul 18, 2024
•
63
mgoin/Meta-Llama-3-8B-Instruct-ds
Text Generation
•
Updated
Jul 3, 2024
mgoin/Meta-Llama-3-8B-Instruct-pruned50-quant-ds
Text Generation
•
Updated
Jun 28, 2024
Previous
1
2
3
4
Next