etri-xainlp
/

kor-llama2-13b-dpo

Text Generation

text-generation-inference

Model card Files Files and versions Community

kor-llama2-13b-dpo / README.md

etri-xainlp's picture

Update README.md

c3ed617 verified 12 months ago

|

history blame contribute delete

446 Bytes

	---
	license: cc-by-nc-4.0
	---
	# etri-xainlp/kor-llama2-13b-dpo

	## Model Details

	Model Developers ETRI xainlp team

	Input text only.

	Output text only.

	Model Architecture

	Base Model [meta-llama/Llama-13b-hf](https://huggingface.co/meta-llama/Llama-2-13b-hf)

	Training Dataset

	- sft+lora: 1,821 k instruction-following set

	- dpo+lora: 221 k user preference set

	- We use A100 GPU 80GB * 8, when training.