Llama-3.1-8B-Instruct-GGUF / README.md

pbatra

Upload README.md with huggingface_hub

b83d2b1 verified 9 months ago

preview code

raw

history blame contribute delete

985 Bytes

metadata

base_model: meta-llama/Llama-3.1-8B-Instruct
language:
  - en
  - de
  - fr
  - it
  - pt
  - hi
  - es
  - th
tags:
  - transformers
  - safetensors
  - llama
  - text-generation
  - facebook
  - meta
  - pytorch
  - llama-3
  - conversational
  - en
  - de
  - fr
  - it
  - pt
  - hi
  - es
  - th
  - arxiv:2204.05149
  - base_model:meta-llama/Llama-3.1-8B
  - base_model:finetune:meta-llama/Llama-3.1-8B
  - license:llama3.1
  - autotrain_compatible
  - text-generation-inference
  - endpoints_compatible
  - region:us
license: llama3.1
inference: false
quantized_by: pbatra

Llama-3.1-8B-Instruct

This repository contains quantized versions of the model from the original repository: meta-llama/Llama-3.1-8B-Instruct.

Name	Quantization Method	Size (GB)
llama-3.1-8b-instruct.Q8_0.gguf	q8_0	7.95
llama-3.1-8b-instruct.Q4_0.gguf	q4_0	4.34