MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper • 2506.13585 • Published Jun 16 • 254
Gromov series [GRPO] Collection Specific datasets particulary effective in GRPO • 6 items • Updated May 28 • 1