etri-xainlp
/

llama3-8b-dpo_v1

Text Generation

text-generation-inference

Model card Files Files and versions Community

etri-xainlp/llama3-8b-dpo_v1

Model Details

Model Developers ETRI xainlp team

Input text only.

Output text only.

Model Architecture

Base Model meta-llama/Llama-8b-hf

Training Dataset

sft+lora: 1,821 k instruction-following set
dpo+lora: 221 k user preference set
We use A100 GPU 80GB * 8, when training.

Downloads last month: 1,320

Safetensors

Model size

8.03B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for etri-xainlp/llama3-8b-dpo_v1

Quantizations

1 model

Spaces using etri-xainlp/llama3-8b-dpo_v1 7