Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Tandogan
/
MNLP_M2_dpo_model
like
0
Safetensors
Tandogan/sft_dataset_final_train
Tandogan/MNLP_M2_dpo_dataset
qwen3
dpo
unsloth
trl
qwen
instruction-tuning
preference-modeling
mnlp
License:
apache-2.0
Model card
Files
Files and versions
Community
main
MNLP_M2_dpo_model
/
tokenizer_config.json
Commit History
Upload tokenizer
97dffbe
verified
Tandogan
commited on
May 25