Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
wzhouad
/
Llama3-Instruct-8B-WPO-HB-v2
like
5
Text Generation
Transformers
Safetensors
wzhouad/llama3-ultrafeedback-hybrid-v2
llama
alignment-handbook
conversational
text-generation-inference
arxiv:
2406.11827
arxiv:
2310.01377
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
b4adefb
Llama3-Instruct-8B-WPO-HB-v2
/
README.md
Commit History
Update README.md
b4adefb
verified
wzhouad
commited on
Jul 31, 2024
Update README.md
e7c6e9c
verified
wzhouad
commited on
Jul 31, 2024
Update README.md
3a3f99d
verified
wzhouad
commited on
Jul 31, 2024
Update README.md
afab71f
verified
wzhouad
commited on
Jul 24, 2024
Upload LlamaForCausalLM
cf99201
verified
wzhouad
commited on
Jul 24, 2024