Update README.md
Browse files
README.md
CHANGED
|
@@ -32,6 +32,7 @@ tags:
|
|
| 32 |
- **Fine-tuned from model:** [Alpaca (reprod.)](https://huggingface.co/PKU-Alignment/alpaca-7b-reproduced) (reproduced version of [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca))
|
| 33 |
- **Dataset:** [PKU-SafeRLHF-30K](https://huggingface.co/datasets/PKU-Alignment/PKU-SafeRLHF-30K)
|
| 34 |
- **SACPO Paper:** <https://arxiv.org/abs/2404.11049>
|
|
|
|
| 35 |
- **Model Alias:** P-SACPO 0.75
|
| 36 |
|
| 37 |
## Usage: How to Talk with the Model
|
|
|
|
| 32 |
- **Fine-tuned from model:** [Alpaca (reprod.)](https://huggingface.co/PKU-Alignment/alpaca-7b-reproduced) (reproduced version of [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca))
|
| 33 |
- **Dataset:** [PKU-SafeRLHF-30K](https://huggingface.co/datasets/PKU-Alignment/PKU-SafeRLHF-30K)
|
| 34 |
- **SACPO Paper:** <https://arxiv.org/abs/2404.11049>
|
| 35 |
+
- **GitHub:** <https://github.com/line/sacpo>
|
| 36 |
- **Model Alias:** P-SACPO 0.75
|
| 37 |
|
| 38 |
## Usage: How to Talk with the Model
|