v000000
/

Llama-3-Instruct-15B-SPPO-Iter3-SH

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

v000000 commited on Jul 13, 2024

Commit

fa2c6d6

·

verified ·

1 Parent(s): 829e357

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -19,6 +19,8 @@ Upscaled version of [Llama-3-Instruct-8B-SPPO-Iter3](https://huggingface.co/UCLA
 Self-Play Preference Optimization for Language Model Alignment (https://arxiv.org/abs/2405.00675)
 ## merge
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

 Self-Play Preference Optimization for Language Model Alignment (https://arxiv.org/abs/2405.00675)
+---------------------------------------------------------------------
 ## merge
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).