learning_rate: 5e-5
num_epochs: 1
total_batch_size: 16
- Downloads last month
- 5
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for lblaoke/opt-125m-hh-rlhf-rm-trl-v5
Base model
facebook/opt-125m