Teaching language models to think efficiently with Adaptive Length Penalty (ALP)
AI & ML interests
Scaling up good synthetic reasoning. Post-training and synthetic data research lab.
Recent Activity
View all activity
Organization Card
This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers
-
SynthLabsAI/Big-Math-RL-Verified
Viewer • Updated • 251k • 5.6k • 188 -
SynthLabsAI/Big-Math-RL-UNVERIFIED
Viewer • Updated • 34.9k • 21 • 1 -
nlile/NuminaMath-1.5-RL-Verifiable
Viewer • Updated • 131k • 6.67k • 5 -
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
Paper • 2502.17387 • Published • 6
Teaching language models to think efficiently with Adaptive Length Penalty (ALP)
This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers
-
SynthLabsAI/Big-Math-RL-Verified
Viewer • Updated • 251k • 5.6k • 188 -
SynthLabsAI/Big-Math-RL-UNVERIFIED
Viewer • Updated • 34.9k • 21 • 1 -
nlile/NuminaMath-1.5-RL-Verifiable
Viewer • Updated • 131k • 6.67k • 5 -
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
Paper • 2502.17387 • Published • 6
datasets
6
SynthLabsAI/Big-Math-RL-Verified
Viewer
•
Updated
•
251k
•
5.6k
•
188
SynthLabsAI/Big-Math-RL-UNVERIFIED
Viewer
•
Updated
•
34.9k
•
21
•
1
SynthLabsAI/PERSONA
Viewer
•
Updated
•
200k
•
3.59k
•
16
SynthLabsAI/PERSONA_subset
Viewer
•
Updated
•
5k
•
3.5k
•
1
SynthLabsAI/PRISM-Filter
Viewer
•
Updated
•
3.87k
•
11
SynthLabsAI/Synthetic-Personas
Viewer
•
Updated
•
1k
•
14
•
1