Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
jointpreferences
/
mistral_7b_sft_helpful
like
1
Follow
jointpreferences
4
Text Generation
Transformers
Safetensors
mistral
text-generation-inference
Inference Endpoints
arxiv:
2404.00530
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
hbXNov
commited on
Apr 2, 2024
Commit
80ca676
·
verified
·
1 Parent(s):
1f4caa5
Create README.md
Browse files
Files changed (1)
hide
show
README.md
+5
-0
README.md
ADDED
Viewed
@@ -0,0 +1,5 @@
1
+
Paper: Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization
2
+
3
+
Link: https://arxiv.org/abs/2404.00530
4
+
5
+
Github: https://github.com/Hritikbansal/dove