Update README.md
Browse files
README.md
CHANGED
@@ -2,7 +2,7 @@
|
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
|
5 |
-
###
|
6 |
|
7 |
|
8 |
This is a fine-tuned Qwen2.5-7B-Instruct model on the [Egida-DPO-Qwen2.5-7B-Instruct](https://huggingface.co/datasets/HPAI-BSC/Egida/viewer/Egida-DPO-Qwen2.5-7B-Instruct) dataset.
|
@@ -11,6 +11,7 @@ Specifically, the [Egida](https://huggingface.co/datasets/HPAI-BSC/Egida/viewer/
|
|
11 |
dataset for this model. This allows us to experiment with a DPO dataset composed by triplets < ”question”, ”chosen answer”, ”discarded answer” > which contain questions that elicit
|
12 |
unsafe responses by this target model, as well as the unsafe responses produced by it.
|
13 |
|
|
|
14 |
|
15 |
|
16 |
```
|
|
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
|
5 |
+
### Model Summary
|
6 |
|
7 |
|
8 |
This is a fine-tuned Qwen2.5-7B-Instruct model on the [Egida-DPO-Qwen2.5-7B-Instruct](https://huggingface.co/datasets/HPAI-BSC/Egida/viewer/Egida-DPO-Qwen2.5-7B-Instruct) dataset.
|
|
|
11 |
dataset for this model. This allows us to experiment with a DPO dataset composed by triplets < ”question”, ”chosen answer”, ”discarded answer” > which contain questions that elicit
|
12 |
unsafe responses by this target model, as well as the unsafe responses produced by it.
|
13 |
|
14 |
+
### Citation Information
|
15 |
|
16 |
|
17 |
```
|