Model Card for Model ID

Multilingual fine tuned version of LLAMA-3-8B quantized in 4 bits.

Model Details

Model Description

Multilingual fine tuned version of LLAMA-3-8B quantized in 4 bits using common open source datasets and showing improvements over multilingual tasks. It has been used the standard bitquantized technique for post-fine-tuning quantization reducing the computational time complexity and space complexity required to run the model. The overall architecture it's all LLAMA-3 based.

  • Developed by: Daniele Comi
  • Model type: LLAMA-3-8B
  • Language(s) (NLP): Multilingual
  • License: MIT
  • Finetuned from model: LLAMA-3-8B
Downloads last month
88
Safetensors
Model size
3.6B params
Tensor type
F32
FP16
U8
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.