kaitchup
/

Llama-3-8b-awq-4bit

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

Model Details

This is meta-llama/Meta-Llama-3-8B quantized and serialized with AutoAWQ in 4-bit.

Details here:

Fine-tune Llama 3 on Your Computer

Developed by: The Kaitchup
Language(s) (NLP): English
License: Apache 2.0 license ; You must also accept the Llama 3 license

Downloads last month: 7

Safetensors

Model size

1.98B params

Tensor type

FP16

·

I32

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support