Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
wzhouad
/
Llama3-Instruct-8B-WPO-HB-v2
like
4
Text Generation
Transformers
Safetensors
wzhouad/llama3-ultrafeedback-hybrid-v2
llama
alignment-handbook
conversational
text-generation-inference
Inference Endpoints
arxiv:
2406.11827
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
Llama3-Instruct-8B-WPO-HB-v2
/
model-00005-of-00007.safetensors
Commit History
Upload LlamaForCausalLM
cf99201
verified
wzhouad
commited on
Jul 24