SimpleRL-Zoo - a hkust-nlp Collection

hkust-nlp 's Collections

RL-Verifier-Pitfalls

Laser

M-STAR

CodeI/O

Deita

SimpleRL-Zoo

updated May 5

The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild"