etri-xainlp/llama2-13b-lima-sft-dpo

Model Details

Model Developers ETRI xainlp team

Input text only.

Output text only.

Model Architecture

Base Model meta-llama/Llama-13b-hf

Training Dataset

  • fully sft: 650k instruction-following set

  • lima sft: 280k instruction-following set

  • dpo+lora: 90k user preference set

  • We use A100 GPU 80GB * 7, when training.

Downloads last month
3
Safetensors
Model size
13B params
Tensor type
FP16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for etri-xainlp/llama2-13b-lima-sft-dpo

Quantizations
3 models

Spaces using etri-xainlp/llama2-13b-lima-sft-dpo 6