This is a pretrained model used in PPO toy example from CarperAI/trlX
- Downloads last month
- 21
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support
This is a pretrained model used in PPO toy example from CarperAI/trlX