Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
princeton-nlp
/
gemma-2-9b-it-DPO
like
9
Text Generation
Transformers
Safetensors
princeton-nlp/gemma2-ultrafeedback-armorm
gemma2
alignment-handbook
Generated from Trainer
conversational
text-generation-inference
arxiv:
2405.14734
arxiv:
2310.01377
arxiv:
2406.12845
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
1a75e57
gemma-2-9b-it-DPO
Ctrl+K
Ctrl+K
2 contributors
History:
2 commits
princeton-nlp
Upload Gemma2ForCausalLM
1a75e57
verified
10 months ago
.gitattributes
Safe
1.52 kB
initial commit
10 months ago
README.md
Safe
5.17 kB
Upload Gemma2ForCausalLM
10 months ago
config.json
950 Bytes
Upload Gemma2ForCausalLM
10 months ago
generation_config.json
Safe
168 Bytes
Upload Gemma2ForCausalLM
10 months ago
model-00001-of-00004.safetensors
4.9 GB
LFS
Upload Gemma2ForCausalLM
10 months ago
model-00002-of-00004.safetensors
4.95 GB
LFS
Upload Gemma2ForCausalLM
10 months ago
model-00003-of-00004.safetensors
4.96 GB
LFS
Upload Gemma2ForCausalLM
10 months ago
model-00004-of-00004.safetensors
3.67 GB
LFS
Upload Gemma2ForCausalLM
10 months ago
model.safetensors.index.json
Safe
39.1 kB
Upload Gemma2ForCausalLM
10 months ago