Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MattBou00
/
kybukre0-rlhf-checkpoint-pythia-410m-irl
like
0
Safetensors
gpt_neo
Model card
Files
Files and versions
xet
Community
main
kybukre0-rlhf-checkpoint-pythia-410m-irl
506 MB
1 contributor
History:
2 commits
MattBou00
Upload final RLHF model - irl reward model
f79c4e7
verified
3 months ago
.gitattributes
Safe
1.52 kB
initial commit
3 months ago
README.md
1.04 kB
Upload final RLHF model - irl reward model
3 months ago
config.json
1.06 kB
Upload final RLHF model - irl reward model
3 months ago
generation_config.json
Safe
119 Bytes
Upload final RLHF model - irl reward model
3 months ago
merges.txt
Safe
456 kB
Upload final RLHF model - irl reward model
3 months ago
model.safetensors
501 MB
xet
Upload final RLHF model - irl reward model
3 months ago
special_tokens_map.json
Safe
470 Bytes
Upload final RLHF model - irl reward model
3 months ago
tokenizer.json
Safe
3.56 MB
Upload final RLHF model - irl reward model
3 months ago
tokenizer_config.json
Safe
555 Bytes
Upload final RLHF model - irl reward model
3 months ago
training_config.yaml
1.56 kB
Upload final RLHF model - irl reward model
3 months ago
vocab.json
Safe
798 kB
Upload final RLHF model - irl reward model
3 months ago