Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
's Collections
DeepSeek-R1-Distill Quantized
Granite 3.1 Quantization
Sparse-Llama-3.1-2of4
Vision Language Models Quantization
FP8 LLMs for vLLM
Llama-3.2 Quantization
Llama-3.1 Quantization
INT8 LLMs for vLLM
INT4 LLMs for vLLM
Sparse Foundational Llama 2 Models
Compression Papers
DeepSparse Sparse LLMs
Sparse Finetuning MPT
Compressed LLMs from the Community
Granite 3.1 Quantization
updated
Jan 24
Upvote
-
RedHatAI/granite-3.1-2b-instruct-quantized.w4a16
Text Generation
•
0.5B
•
Updated
Feb 28
•
44
RedHatAI/granite-3.1-2b-instruct-quantized.w8a8
Text Generation
•
3B
•
Updated
Feb 28
•
1.12k
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16
Text Generation
•
1B
•
Updated
May 30
•
1.13k
•
1
RedHatAI/granite-3.1-8b-instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
May 30
•
2.02k
•
1
RedHatAI/granite-3.1-2b-instruct-FP8-dynamic
Text Generation
•
3B
•
Updated
Jan 28
•
7
RedHatAI/granite-3.1-8b-instruct-FP8-dynamic
Text Generation
•
8B
•
Updated
May 30
•
168
•
1
RedHatAI/granite-3.1-2b-base-quantized.w4a16
Text Generation
•
0.5B
•
Updated
Feb 28
•
43
RedHatAI/granite-3.1-2b-base-quantized.w8a8
Text Generation
•
3B
•
Updated
Feb 28
•
1.09k
RedHatAI/granite-3.1-8b-base-FP8-dynamic
Text Generation
•
8B
•
Updated
Feb 20
•
7
RedHatAI/granite-3.1-2b-base-FP8-dynamic
Text Generation
•
3B
•
Updated
Jan 30
•
13
RedHatAI/granite-3.1-8b-base-quantized.w4a16
Text Generation
•
1B
•
Updated
May 30
•
86
RedHatAI/granite-3.1-8b-base-quantized.w8a8
Text Generation
•
8B
•
Updated
Feb 28
•
1.09k
Upvote
-
Share collection
View history
Collection guide
Browse collections