Triangle104 commited on
Commit
bc35173
·
verified ·
1 Parent(s): e3f9acb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -14
README.md CHANGED
@@ -26,24 +26,35 @@ Refer to the [original model card](https://huggingface.co/nbeerbower/Dumpling-Qw
26
 
27
  ---
28
  Dumpling-Qwen2.5-32B
29
-
30
  nbeerbower/EVA-abliterated-TIES-Qwen2.5-1.5B finetuned on:
31
 
32
- nbeerbower/GreatFirewall-DPO
33
- nbeerbower/Schule-DPO
34
- nbeerbower/Purpura-DPO
35
- nbeerbower/Arkhaios-DPO
36
- jondurbin/truthy-dpo-v0.1
37
- antiven0m/physical-reasoning-dpo
38
- flammenai/Date-DPO-NoAsterisks
39
- flammenai/Prude-Phi3-DPO
40
- Atsunori/HelpSteer2-DPO (1,000 samples)
41
- jondurbin/gutenberg-dpo-v0.1
42
- nbeerbower/gutenberg2-dpo
43
- nbeerbower/gutenberg-moderne-dpo.
44
 
45
- Method
 
 
 
 
 
 
 
 
 
 
46
 
 
 
 
 
 
 
 
 
 
 
 
 
47
  QLoRA ORPO tune with 2x RTX 3090 for 2 epochs.
48
 
49
  ---
 
26
 
27
  ---
28
  Dumpling-Qwen2.5-32B
29
+ -
30
  nbeerbower/EVA-abliterated-TIES-Qwen2.5-1.5B finetuned on:
31
 
32
+ -nbeerbower/GreatFirewall-DPO
 
 
 
 
 
 
 
 
 
 
 
33
 
34
+ -nbeerbower/Schule-DPO
35
+
36
+ -nbeerbower/Purpura-DPO
37
+
38
+ -nbeerbower/Arkhaios-DPO
39
+
40
+ -jondurbin/truthy-dpo-v0.1
41
+
42
+ -antiven0m/physical-reasoning-dpo
43
+
44
+ -flammenai/Date-DPO-NoAsterisks
45
 
46
+ -flammenai/Prude-Phi3-DPO
47
+
48
+ -Atsunori/HelpSteer2-DPO (1,000 samples)
49
+
50
+ -jondurbin/gutenberg-dpo-v0.1
51
+
52
+ -nbeerbower/gutenberg2-dpo
53
+
54
+ -nbeerbower/gutenberg-moderne-dpo.
55
+
56
+ Method
57
+ -
58
  QLoRA ORPO tune with 2x RTX 3090 for 2 epochs.
59
 
60
  ---