DPO finetuned model

  • Developed by: choco-conoz
  • License: apache-2.0
  • DPO Finetuned from model : unsloth/Llama-3.2-1B
Downloads last month
5
Safetensors
Model size
1.24B params
Tensor type
BF16
·
Inference Providers NEW

Model tree for choco-conoz/TwinLlama-3.2-1B-DPO

Finetuned
(87)
this model

Dataset used to train choco-conoz/TwinLlama-3.2-1B-DPO

Space using choco-conoz/TwinLlama-3.2-1B-DPO 1