ppo-LunarLander-v2 / results.json
HugeFighter's picture
LunarLander-v2 agent trained using PPO algorithm
68e9220 verified
raw
history blame
158 Bytes
{"mean_reward": 253.7840121, "std_reward": 16.600022850579595, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2025-01-01T09:44:27.064531"}