Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AdversarialRLHF
/
pythia410m-rm-tldr6.9b_prefix_in_chosen
like
0
Follow
Adversarial Goodhart RLHF
3
Text Classification
Transformers
Safetensors
gpt_neox
Generated from Trainer
trl
reward-trainer
text-generation-inference
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
pythia410m-rm-tldr6.9b_prefix_in_chosen
Commit History
Model save
7a59dc0
verified
Muqeeth
commited on
Apr 30
Training in progress, step 1823, checkpoint
e114a20
verified
Muqeeth
commited on
Apr 30
Training in progress, step 1823
af58bb1
verified
Muqeeth
commited on
Apr 30
Training in progress, step 1460, checkpoint
1954d7e
verified
Muqeeth
commited on
Apr 30
Training in progress, step 1460
3b3a0e8
verified
Muqeeth
commited on
Apr 30
Training in progress, step 1095, checkpoint
bd80e04
verified
Muqeeth
commited on
Apr 30
Training in progress, step 1095
b5a1def
verified
Muqeeth
commited on
Apr 30
Training in progress, step 730, checkpoint
77089bd
verified
Muqeeth
commited on
Apr 30
Training in progress, step 730
b02525c
verified
Muqeeth
commited on
Apr 30
Training in progress, step 365, checkpoint
0ff65bd
verified
Muqeeth
commited on
Apr 30
Training in progress, step 365
4334f6b
verified
Muqeeth
commited on
Apr 30
initial commit
fbe1b6f
verified
Muqeeth
commited on
Apr 29