Running 43 43 Stick To Your Role! Leaderboard 🎭 Benchmarking LLMs on the stability of simulated populations
SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling Paper • 2410.12481 • Published Oct 16, 2024
MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces Paper • 2502.07709 • Published Feb 11
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting Paper • 2410.19920 • Published Oct 25, 2024
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Paper • 2402.09844 • Published Feb 15, 2024 • 21
Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning Paper • 2302.02662 • Published Feb 6, 2023 • 1
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL Paper • 2103.09815 • Published Mar 17, 2021 • 2