HPAI-BSC
/

Qwen2.5-7B-Instruct-Egida-DPO

Model card Files Files and versions Community

danihinjos commited on Feb 26

Commit

985a041

·

verified ·

1 Parent(s): 43b2502

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -2,7 +2,7 @@
 license: apache-2.0
 ---
-### Citation Information
 This is a fine-tuned Qwen2.5-7B-Instruct model on the [Egida-DPO-Qwen2.5-7B-Instruct](https://huggingface.co/datasets/HPAI-BSC/Egida/viewer/Egida-DPO-Qwen2.5-7B-Instruct) dataset.
@@ -11,6 +11,7 @@ Specifically, the [Egida](https://huggingface.co/datasets/HPAI-BSC/Egida/viewer/
 dataset for this model. This allows us to experiment with a DPO dataset composed by triplets < ”question”, ”chosen answer”, ”discarded answer” > which contain questions that elicit
 unsafe responses by this target model, as well as the unsafe responses produced by it.
 ```

 license: apache-2.0
 ---
+### Model Summary
 This is a fine-tuned Qwen2.5-7B-Instruct model on the [Egida-DPO-Qwen2.5-7B-Instruct](https://huggingface.co/datasets/HPAI-BSC/Egida/viewer/Egida-DPO-Qwen2.5-7B-Instruct) dataset.
 dataset for this model. This allows us to experiment with a DPO dataset composed by triplets < ”question”, ”chosen answer”, ”discarded answer” > which contain questions that elicit
 unsafe responses by this target model, as well as the unsafe responses produced by it.
+### Citation Information
 ```