metadata
base_model: meta-llama/Llama-3.1-8B-Instruct
language:
- en
- de
- fr
- it
- pt
- hi
- es
- th
tags:
- transformers
- safetensors
- llama
- text-generation
- facebook
- meta
- pytorch
- llama-3
- conversational
- en
- de
- fr
- it
- pt
- hi
- es
- th
- arxiv:2204.05149
- base_model:meta-llama/Llama-3.1-8B
- base_model:finetune:meta-llama/Llama-3.1-8B
- license:llama3.1
- autotrain_compatible
- text-generation-inference
- endpoints_compatible
- region:us
license: llama3.1
inference: false
quantized_by: pbatra
Llama-3.1-8B-Instruct
This repository contains quantized versions of the model from the original repository: meta-llama/Llama-3.1-8B-Instruct.
Name | Quantization Method | Size (GB) |
---|---|---|
llama-3.1-8b-instruct.Q8_0.gguf | q8_0 | 7.95 |
llama-3.1-8b-instruct.Q4_0.gguf | q4_0 | 4.34 |