vwxyzjn/train_sft_accelerate_summarize__tldr__seed1304__1697169024 Text Generation • 0.1B • Updated Oct 13, 2023 • 7
vwxyzjn/train_sft_accelerate_summarize__tldr__seed1301__1697169023 Text Generation • 0.1B • Updated Oct 13, 2023 • 13
vwxyzjn/train_sft_accelerate_summarize__tldr__seed1302__1697169015 Text Generation • 0.1B • Updated Oct 13, 2023 • 11
vwxyzjn/train_sft_accelerate_summarize__tldr__seed1403__1697168471 Text Generation • 0.0B • Updated Oct 13, 2023 • 9
vwxyzjn/train_sft_accelerate_summarize__tldr__seed1404__1697168449 Text Generation • 0.0B • Updated Oct 13, 2023 • 7
vwxyzjn/train_sft_accelerate_summarize__tldr__seed1402__1697168345 Text Generation • 0.0B • Updated Oct 13, 2023 • 7
vwxyzjn/train_sft_accelerate_summarize__tldr__seed1401__1697168334 Text Generation • 0.0B • Updated Oct 13, 2023 • 8
vwxyzjn/train_sft_accelerate_summarize__tldr__seed1400__1697168329 Text Generation • 0.0B • Updated Oct 13, 2023 • 7
vwxyzjn/train_policy_accelerate__sentiment_offline_5k.json__seed1__1696447674 Text Generation • 0.1B • Updated Oct 4, 2023 • 11
vwxyzjn/Breakout-v5-cleanba_impala_envpool_machado_atari_wrapper-seed1 Reinforcement Learning • Updated Mar 25, 2023
vwxyzjn/Breakout-v5-cleanba_ppo_envpool_impala_atari_wrapper-seed1 Reinforcement Learning • Updated Mar 2, 2023
vwxyzjn/BigfishHard-v0-cleanba_ppo_envpool_procgen-seed1 Reinforcement Learning • Updated Feb 27, 2023
vwxyzjn/StarpilotHard-v0-cleanba_ppo_envpool_procgen-seed1 Reinforcement Learning • Updated Feb 27, 2023
vwxyzjn/Breakout-v5-cleanba_ppo_envpool_impala_atari_wrapper_naturecnn-seed1 Reinforcement Learning • Updated Feb 22, 2023
vwxyzjn/ChaserHard-v0-cleanba_ppo_envpool_procgen-seed1 Reinforcement Learning • Updated Feb 22, 2023
vwxyzjn/Breakout-v5-cleanba_ppo_envpool_impala_atari_wrapper_large-seed1 Reinforcement Learning • Updated Feb 19, 2023
vwxyzjn/Breakout-v5-ppo_atari_envpool_xla_jax_scan-seed3 Reinforcement Learning • Updated Jan 1, 2023
vwxyzjn/Breakout-v5-ppo_atari_envpool_xla_jax_scan-seed2 Reinforcement Learning • Updated Jan 1, 2023
vwxyzjn/Breakout-v5-ppo_atari_envpool_xla_jax_scan-seed1 Reinforcement Learning • Updated Jan 1, 2023
vwxyzjn/Breakout-v5-ppo_atari_envpool_async_jax_scan_impalanet_machado-seed1 Reinforcement Learning • Updated Jan 1, 2023