etri-xainlp/kor-llama2-13b-dpo

Model Details

Model Developers ETRI xainlp team

Input text only.

Output text only.

Model Architecture

Base Model meta-llama/Llama-13b-hf

Training Dataset

  • sft+lora: 1,821 k instruction-following set

  • dpo+lora: 221 k user preference set

  • We use A100 GPU 80GB * 8, when training.

Downloads last month
1,716
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support