CartPole-v1_ppo
Run: cvb5lyfw
โ Env: CartPole-v1
โ Algo: ppo
This repository contains artifacts from a Gymnasium Solver training run.
Contents
- Config:
artifacts/configs/config.json
- Checkpoints:
artifacts/checkpoints/*.ckpt
- Logs:
artifacts/logs/*.log
- Video:
artifacts/videos/**/best_checkpoint.mp4
(also previewed below)
Preview
If the video above doesn't load, try the fallback: replay.mp4
Config (excerpt)
{
"env_id": "CartPole-v1",
"algo_id": "ppo",
"n_steps": 32,
"batch_size": 256,
"n_epochs": 20,
"n_timesteps": 100000.0,
"seed": 42,
"n_envs": 8,
"obs_type": "rgb",
"policy": "MlpPolicy",
"learning_rate": 0.001,
"gamma": 0.98,
"gae_lambda": 0.8,
"ent_coef": 0.0,
"vf_coef": 0.5,
"clip_range": 0.2,
"normalize_advantages": "batch"
}
Evaluation results
- Best Eval Reward on CartPole-v1self-reported272.400
- Current Eval Reward on CartPole-v1self-reported500.000
- Epoch on CartPole-v1self-reported199.000
- Total Timesteps on CartPole-v1self-reported0.000