shawhin commited on
Commit
3e198d0
·
verified ·
1 Parent(s): a6f679a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -11,7 +11,7 @@ licence: license
11
 
12
  # Model Card for Qwen2.5-0.5B-DPO
13
 
14
- Fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) on my YouTube title preferences via DPO.
15
 
16
  Video link: coming soon! <br>
17
  [Blog link](https://shawhin.medium.com/fine-tuning-llms-on-human-feedback-rlhf-dpo-1c693dbc4cbf) <br>
 
11
 
12
  # Model Card for Qwen2.5-0.5B-DPO
13
 
14
+ Fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) to generate YouTube titles based on my preferences.
15
 
16
  Video link: coming soon! <br>
17
  [Blog link](https://shawhin.medium.com/fine-tuning-llms-on-human-feedback-rlhf-dpo-1c693dbc4cbf) <br>