hkust-nlp/SimpleRL-Zoo-Data
Viewer
•
Updated
•
53.1k
•
528
•
6
The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild"