etri-xainlp/llama3-8b-dpo_v1
Model Details
Model Developers ETRI xainlp team
Input text only.
Output text only.
Model Architecture
Base Model meta-llama/Llama-8b-hf
Training Dataset
sft+lora: 1,821 k instruction-following set
dpo+lora: 221 k user preference set
We use A100 GPU 80GB * 8, when training.
- Downloads last month
- 1,663
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support