Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
82.6
TFLOPS
75
15
20
Michael Goin
PRO
mgoin
Follow
rahulsinghal's profile picture
tudorizer's profile picture
platypusai's profile picture
29 followers
·
11 following
mgoin_
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
published
a model
2 days ago
neuralmagic/Qwen2.5-3B-quantized.w8a8
published
a model
2 days ago
neuralmagic/Qwen2.5-14B-quantized.w8a8
published
a model
2 days ago
neuralmagic/Qwen2.5-14B-FP8-dynamic
View all activity
Organizations
mgoin
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
published
19 models
2 days ago
neuralmagic/Qwen2.5-3B-quantized.w8a8
Text Generation
•
Updated
Dec 3, 2024
•
5
neuralmagic/Qwen2.5-14B-quantized.w8a8
Text Generation
•
Updated
Dec 3, 2024
•
9
neuralmagic/Qwen2.5-14B-FP8-dynamic
Text Generation
•
Updated
Dec 3, 2024
•
15
neuralmagic/Qwen2.5-72B-FP8-dynamic
Text Generation
•
Updated
Dec 3, 2024
•
2
neuralmagic/Qwen2.5-3B-FP8-dynamic
Text Generation
•
Updated
Dec 3, 2024
•
4
neuralmagic/Qwen2.5-0.5B-FP8-dynamic
Text Generation
•
Updated
Dec 3, 2024
•
2
neuralmagic/Qwen2.5-72B-quantized.w8a16
Text Generation
•
Updated
Nov 26, 2024
•
4
neuralmagic/Qwen2.5-32B-quantized.w8a16
Text Generation
•
Updated
Nov 26, 2024
•
4
neuralmagic/Qwen2.5-7B-quantized.w8a16
Text Generation
•
Updated
Nov 26, 2024
•
4
neuralmagic/Qwen2.5-1.5B-quantized.w8a16
Text Generation
•
Updated
Nov 26, 2024
•
4
neuralmagic/Qwen2.5-0.5B-Instruct-quantized.w8a8
Text Generation
•
Updated
Dec 9, 2024
•
2
neuralmagic/Qwen2.5-3B-quantized.w8a16
Text Generation
•
Updated
Nov 26, 2024
•
4
neuralmagic/Qwen2.5-72B-quantized.w8a8
Text Generation
•
Updated
Dec 3, 2024
•
4
neuralmagic/Qwen2.5-7B-quantized.w8a8
Text Generation
•
Updated
Dec 3, 2024
•
2
neuralmagic/Qwen2.5-1.5B-quantized.w8a8
Text Generation
•
Updated
Dec 3, 2024
•
2
neuralmagic/Qwen2.5-0.5B-quantized.w8a8
Text Generation
•
Updated
Dec 3, 2024
•
2
neuralmagic/Qwen2.5-0.5B-quantized.w8a16
Text Generation
•
Updated
Nov 26, 2024
•
4
neuralmagic/Qwen2.5-1.5B-FP8-dynamic
Text Generation
•
Updated
Dec 3, 2024
•
4
neuralmagic/Qwen2.5-7B-FP8-dynamic
Text Generation
•
Updated
Dec 3, 2024
•
4
updated
a model
2 days ago
nm-testing/whisper-large-v2-FP8-dynamic
Updated
2 days ago
•
4
Load more