Update README.md
Browse files
README.md
CHANGED
@@ -8,7 +8,7 @@ license: apache-2.0
|
|
8 |
- **Point of Contact:** [Adrián Tormos](mailto:[email protected])
|
9 |
|
10 |
|
11 |
-
|
12 |
|
13 |
|
14 |
This is a fine-tuned Qwen2.5-7B-Instruct model on the [Egida-DPO-Qwen2.5-7B-Instruct](https://huggingface.co/datasets/HPAI-BSC/Egida/viewer/Egida-DPO-Qwen2.5-7B-Instruct) dataset.
|
@@ -17,7 +17,7 @@ Specifically, the [Egida](https://huggingface.co/datasets/HPAI-BSC/Egida/viewer/
|
|
17 |
dataset for this model. This allows us to experiment with a DPO dataset composed by triplets < ”question”, ”chosen answer”, ”discarded answer” > which contain questions that elicit
|
18 |
unsafe responses by this target model, as well as the unsafe responses produced by it.
|
19 |
|
20 |
-
|
21 |
|
22 |
|
23 |
```
|
|
|
8 |
- **Point of Contact:** [Adrián Tormos](mailto:[email protected])
|
9 |
|
10 |
|
11 |
+
## Model Summary
|
12 |
|
13 |
|
14 |
This is a fine-tuned Qwen2.5-7B-Instruct model on the [Egida-DPO-Qwen2.5-7B-Instruct](https://huggingface.co/datasets/HPAI-BSC/Egida/viewer/Egida-DPO-Qwen2.5-7B-Instruct) dataset.
|
|
|
17 |
dataset for this model. This allows us to experiment with a DPO dataset composed by triplets < ”question”, ”chosen answer”, ”discarded answer” > which contain questions that elicit
|
18 |
unsafe responses by this target model, as well as the unsafe responses produced by it.
|
19 |
|
20 |
+
## Citation Information
|
21 |
|
22 |
|
23 |
```
|