Update README.md
Browse files
README.md
CHANGED
@@ -19,6 +19,8 @@ Upscaled version of [Llama-3-Instruct-8B-SPPO-Iter3](https://huggingface.co/UCLA
|
|
19 |
|
20 |
Self-Play Preference Optimization for Language Model Alignment (https://arxiv.org/abs/2405.00675)
|
21 |
|
|
|
|
|
22 |
## merge
|
23 |
|
24 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
|
|
19 |
|
20 |
Self-Play Preference Optimization for Language Model Alignment (https://arxiv.org/abs/2405.00675)
|
21 |
|
22 |
+
---------------------------------------------------------------------
|
23 |
+
|
24 |
## merge
|
25 |
|
26 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|