cais
/

Yi-34B-Chat_RMU

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

alicegatti commited on Apr 23, 2024

Commit

d7aada5

·

verified ·

1 Parent(s): b316047

changed model name

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ library_name: transformers
 ---
-# Yi RMU
 Yi 34B Chat model with hazardous knowledge about biosecurity and cybersecurity "unlearned" using Representation Misdirection for Unlearning (RMU). For more details, please check [our paper](https://arxiv.org/abs/2403.03218).
@@ -24,12 +24,12 @@ Yi 34B Chat model with hazardous knowledge about biosecurity and cybersecurity "
 ## Performance
-Yi RMU has been evaluated on [WMDP](https://huggingface.co/datasets/cais/wmdp), [MMLU](https://huggingface.co/datasets/cais/mmlu) and [MT-Bench](https://huggingface.co/spaces/lmsys/mt-bench). Higher accuracy on MMLU and MT-Bench, and lower accuracy on WMDP are preferred.
-|             |  WMDP-Bio | WMDP-Cyber |  MMLU  | MT-Bench |
-|-------------|:---------:|:----------:|:------:|:--------:|
-| Yi-34B Chat |    75.3   |    49.7    |  72.6  |   7.65   |
-|   Yi RMU    |    30.7   |    29.0    |  70.6  |   7.59   |

 ---
+# Yi 34B Chat RMU
 Yi 34B Chat model with hazardous knowledge about biosecurity and cybersecurity "unlearned" using Representation Misdirection for Unlearning (RMU). For more details, please check [our paper](https://arxiv.org/abs/2403.03218).
 ## Performance
+Yi 34B Chat RMU has been evaluated on [WMDP](https://huggingface.co/datasets/cais/wmdp), [MMLU](https://huggingface.co/datasets/cais/mmlu) and [MT-Bench](https://huggingface.co/spaces/lmsys/mt-bench). Higher accuracy on MMLU and MT-Bench, and lower accuracy on WMDP are preferred.
+|                 |  WMDP-Bio | WMDP-Cyber |  MMLU  | MT-Bench |
+|-----------------|:---------:|:----------:|:------:|:--------:|
+| Yi 34B Chat     |    75.3   |    49.7    |  72.6  |   7.65   |
+| Yi 34B Chat RMU |    30.7   |    29.0    |  70.6  |   7.59   |