Spaces:
Running
Running
metadata
title: README
emoji: π
colorFrom: gray
colorTo: indigo
sdk: static
pinned: false
π OctoThinker is led by GAIR
π― Our Goal: To reshape the pre-training trajectory so models scale better under RL.