The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper β’ 2506.05209 β’ Published 3 days ago β’ 28
Learning Adaptive Parallel Reasoning with Language Models Paper β’ 2504.15466 β’ Published Apr 21 β’ 42
Big-Math Collection This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers β’ 4 items β’ Updated Apr 16 β’ 5
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models Paper β’ 2502.17387 β’ Published Feb 24 β’ 6
Big-Math Collection This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers β’ 4 items β’ Updated Apr 16 β’ 5
Big-Math Collection This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers β’ 4 items β’ Updated Apr 16 β’ 5