Update README.md
Browse files
    	
        README.md
    CHANGED
    
    | @@ -11,7 +11,7 @@ licence: license | |
| 11 |  | 
| 12 | 
             
            # Model Card for Qwen2.5-0.5B-DPO
         | 
| 13 |  | 
| 14 | 
            -
            Fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct)  | 
| 15 |  | 
| 16 | 
             
            Video link: coming soon! <br>
         | 
| 17 | 
             
            [Blog link](https://shawhin.medium.com/fine-tuning-llms-on-human-feedback-rlhf-dpo-1c693dbc4cbf) <br>
         | 
|  | |
| 11 |  | 
| 12 | 
             
            # Model Card for Qwen2.5-0.5B-DPO
         | 
| 13 |  | 
| 14 | 
            +
            Fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) to generate YouTube titles based on my preferences.
         | 
| 15 |  | 
| 16 | 
             
            Video link: coming soon! <br>
         | 
| 17 | 
             
            [Blog link](https://shawhin.medium.com/fine-tuning-llms-on-human-feedback-rlhf-dpo-1c693dbc4cbf) <br>
         | 
