Safetensors
English
qwen2
safety
danihinjos commited on
Commit
bbe3a39
Β·
verified Β·
1 Parent(s): af641e2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -0
README.md CHANGED
@@ -24,6 +24,22 @@ dataset for this model. This results in a DPO dataset composed by triplets < ”
24
  - **Batch Size:** 8
25
  - **LR:** 10βˆ’7
26
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
27
  ## Environmental Impact
28
 
29
 
 
24
  - **Batch Size:** 8
25
  - **LR:** 10βˆ’7
26
 
27
+ ## Performance
28
+
29
+ ### Safety Performance (Attack Success Ratio)
30
+
31
+ | | Egida (test) ↓ | DELPHI ↓ | Alert-Base ↓ | Alert-Adv ↓ |
32
+ |------------------------------|:--------------:|:--------:|:------------:|:-----------:|
33
+ | Qwen-2.5-7B-Instruct | 0.471 | 0.138 | 0.544 | 0.080 |
34
+ | Qwen-2.5-7B-Egida-DPO | 0.322 | 0.118 | 0.410 | 0.045 |
35
+
36
+ ### General Purpose Performance
37
+
38
+ | | OpenLLM Leaderboard (Average) ↑ | MMLU Generative (ROUGE1) ↑ |
39
+ |------------------------------|:---------------------:|:---------------:|
40
+ | Qwen-2.5-7B-Instruct | 0.488 | 0.331 |
41
+ | Qwen-2.5-7B-Egida-DPO | 0.488 | 0.296 |
42
+
43
  ## Environmental Impact
44
 
45