Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Bmingg
/
DPO_vinai_v2_b01_constrained_bleu_1epoch_lr2e4
like
0
Transformers
Safetensors
mbart
text2text-generation
trl
dpo
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DPO_vinai_v2_b01_constrained_bleu_1epoch_lr2e4
/
tokenizer.json
Bmingg
Upload tokenizer
49ed4b2
verified
20 days ago
raw
Copy download link
history
contribute
delete
Safe
4.55 MB
File too large to display, you can
check the raw version
instead.