Teaching language models to think efficiently with Adaptive Length Penalty (ALP)
AI & ML interests
Scaling up good synthetic reasoning. Post-training and synthetic data research lab.
Recent Activity
View all activity
Organization Card
This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers
-
SynthLabsAI/Big-Math-RL-Verified
Viewer • Updated • 251k • 8.29k • 194 -
SynthLabsAI/Big-Math-RL-UNVERIFIED
Viewer • Updated • 34.9k • 6 • 1 -
nlile/NuminaMath-1.5-RL-Verifiable
Viewer • Updated • 131k • 6.27k • 6 -
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
Paper • 2502.17387 • Published • 6
Teaching language models to think efficiently with Adaptive Length Penalty (ALP)
This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers
-
SynthLabsAI/Big-Math-RL-Verified
Viewer • Updated • 251k • 8.29k • 194 -
SynthLabsAI/Big-Math-RL-UNVERIFIED
Viewer • Updated • 34.9k • 6 • 1 -
nlile/NuminaMath-1.5-RL-Verifiable
Viewer • Updated • 131k • 6.27k • 6 -
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
Paper • 2502.17387 • Published • 6
datasets
6
SynthLabsAI/Big-Math-RL-Verified
Viewer
•
Updated
•
251k
•
8.29k
•
194
SynthLabsAI/Big-Math-RL-UNVERIFIED
Viewer
•
Updated
•
34.9k
•
6
•
1
SynthLabsAI/PERSONA
Viewer
•
Updated
•
200k
•
3.5k
•
18
SynthLabsAI/PERSONA_subset
Viewer
•
Updated
•
5k
•
3.44k
•
2
SynthLabsAI/PRISM-Filter
Viewer
•
Updated
•
3.87k
•
1
SynthLabsAI/Synthetic-Personas
Viewer
•
Updated
•
1k
•
5
•
1