v000000
/

Llama-3-Instruct-15B-SPPO-Iter3-SH

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

v000000 commited on Jul 13, 2024

Commit

77dad4a

·

verified ·

1 Parent(s): bc4f4b1

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ Semi-Healed Llama-3 15B Frankenmerge
 # Llama-3-Instruct-15B-SPPO-Iter3-SH
-Upscaled version of [Llama-3-Instruct-8B-SPPO-Iter3](https://huggingface.co/UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3) to 15B parameters with projection swap.
 Self-Play Preference Optimization for Language Model Alignment (https://arxiv.org/abs/2405.00675)

 # Llama-3-Instruct-15B-SPPO-Iter3-SH
+Fully functional upscaled version of [Llama-3-Instruct-8B-SPPO-Iter3](https://huggingface.co/UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3) to 15B parameters with projection swap.
 Self-Play Preference Optimization for Language Model Alignment (https://arxiv.org/abs/2405.00675)