Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
tianyil1
/
MistralForCausalLM_Cal_DPO
like
1
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment_handbook-handbook
Generated from Trainer
Eval Results
arxiv:
5 papers
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
7fba330
MistralForCausalLM_Cal_DPO
Commit History
fix the format issue
7fba330
skyoneliu
commited on
Feb 5
fix the format issue
d5cb610
skyoneliu
commited on
Feb 5
fix the format issue
979ec4f
skyoneliu
commited on
Feb 5
fix the type error
f9d20f6
skyoneliu
commited on
Feb 5
update the readme with evaluation
b6b31da
skyoneliu
commited on
Feb 5
Update README.md
e5b20e8
verified
tianyil1
commited on
Feb 5
Update README.md
642d91b
verified
tianyil1
commited on
Jan 25
init commit
b968f5e
skyoneliu
commited on
Jan 25
initial commit
fc99b34
verified
tianyil1
commited on
Jan 25