Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AlignmentResearch
/
pineapple-oskar_005d_rm_training
like
0
Follow
FAR AI
26
PEFT
Safetensors
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Use this model
main
pineapple-oskar_005d_rm_training
Commit History
Upload trained reward model
71240ac
verified
skar0
commited on
18 days ago
Upload trained reward model
e5c32c2
verified
skar0
commited on
18 days ago
Upload trained reward model
f710838
verified
skar0
commited on
24 days ago
initial commit
91421c2
verified
skar0
commited on
24 days ago