RL - a Ksgk-fy Collection

Ksgk-fy 's Collections

RL

Representation & Optimization

Exciting Papers

Memory

What I don't understand

RL

updated Jul 1

Accelerating Exploration with Unlabeled Prior Data

Paper • 2311.05067 • Published Nov 9, 2023 • 1
Efficient Online Reinforcement Learning with Offline Data

Paper • 2302.02948 • Published Feb 6, 2023 • 2
Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Paper • 2407.04620 • Published Jul 5, 2024 • 35

Note Subset of parameter learnable during inference with SSL target. Great idea.
Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 255

Note Still post-training.
SkillBlender: Towards Versatile Humanoid Whole-Body Loco-Manipulation via Skill Blending

Paper • 2506.09366 • Published Jun 11 • 8
General agents need world models

Paper • 2506.01622 • Published Jun 2 • 1

Note Basically wrong, Markov Decision Process requires decision making invariant to history, the consideration of temporally dependent goal that's not encoded in current state itself is falacy.