Michael Goin

mgoin

mgoin_
mgoin

AI & ML interests

LLM inference optimization, compression, quantization, pruning, distillation

Recent Activity

updated a model 13 days ago

RedHatAI/gemma-4-26B-A4B-it-NVFP4

upvoted a paper 2 months ago

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

new activity 2 months ago

GadflyII/GLM-4.7-Flash-MXFP4:Update MXFP4 format to compressed-tensors

View all activity

Organizations

Collections 1

Papers 4

spaces 5

redhatai-model-explorer

🐳

Browse and filter text models by RedHatAI

Generate text in a chat format

models 102

mgoin/Qwen3-0.6B-MXFP8

0.6B • Updated Feb 16 • 2

mgoin/GLM-4.6-FP8-BLOCK

Text Generation • 357B • Updated Feb 10 • 78

mgoin/Qwen3-0.6B-NVFP4

0.6B • Updated Aug 26, 2025 • 79

mgoin/mlperf-inference-llama3.1-8b-data

Updated Jul 15, 2025

mgoin/Llama-3.1-8B-Instruct-FP8-BLOCK

8B • Updated Jul 1, 2025 • 3

mgoin/SEMIKONG-70B-W4A16-G128

11B • Updated Jun 16, 2025 • 2

mgoin/llama-4-tiny-random

Text Generation • 6.69M • Updated May 14, 2025 • 4

mgoin/Qwen1.5-14B-Chat-GPTQ

Text Generation • Updated Mar 5, 2025 • 3

mgoin/pixtral-12b

Image-Text-to-Text • 13B • Updated Feb 7, 2025 • 319 • 1

mgoin/Llama-3.2-1B-Instruct-FP8-ATTN

1B • Updated Dec 23, 2024 • 1

View 102 models

datasets 4

mgoin/mlperf-inference-llama3.1-8b-data

Viewer • Updated Jul 15, 2025 • 13.4k • 33

mgoin/mlperf-inference-llama2-data

Viewer • Updated May 22, 2025 • 24.6k • 20

mgoin/mlperf-inference-llama3.1-405b-data

Viewer • Updated May 22, 2025 • 8.31k • 142

mgoin/ultrachat_2k

Viewer • Updated May 24, 2024 • 2.05k • 70

Michael Goin

AI & ML interests

Recent Activity

Organizations

Collections 1

mgoin/Nemotron-4-340B-Instruct-hf-FP8

mgoin/Nemotron-4-340B-Base-hf-FP8

mgoin/Nemotron-4-340B-Instruct-hf

mgoin/Nemotron-4-340B-Base-hf

mgoin/Nemotron-4-340B-Instruct-hf-FP8

mgoin/Nemotron-4-340B-Base-hf-FP8

mgoin/Nemotron-4-340B-Instruct-hf

mgoin/Nemotron-4-340B-Base-hf

Papers 4

spaces 5

redhatai-model-explorer

Convert Fp8

Hermes Mistral 7b Vllm

Sparse Llama Gsm8k

TinyStories DeepSparse

models 102

mgoin/Qwen3-0.6B-MXFP8

mgoin/GLM-4.6-FP8-BLOCK

mgoin/Qwen3-0.6B-NVFP4

mgoin/mlperf-inference-llama3.1-8b-data

mgoin/Llama-3.1-8B-Instruct-FP8-BLOCK

mgoin/SEMIKONG-70B-W4A16-G128

mgoin/llama-4-tiny-random

mgoin/Qwen1.5-14B-Chat-GPTQ

mgoin/pixtral-12b

mgoin/Llama-3.2-1B-Instruct-FP8-ATTN

datasets 4

mgoin/mlperf-inference-llama3.1-8b-data

mgoin/mlperf-inference-llama2-data

mgoin/mlperf-inference-llama3.1-405b-data

mgoin/ultrachat_2k

Michael Goin

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 4

spaces 5 Sort: Recently updated

redhatai-model-explorer

Convert Fp8

Hermes Mistral 7b Vllm

Sparse Llama Gsm8k

TinyStories DeepSparse

models 102 Sort: Recently updated

datasets 4 Sort: Recently updated

spaces 5

models 102

datasets 4