AI & ML interests
None defined yet.
Recent Activity
View all activity
Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t"
-
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't
Paper • 2503.16219 • Published • 51 -
knoveleng/OpenRS-GRPO
Text Generation • 2B • Updated • 89 • 5 -
knoveleng/Open-RS1
Text Generation • 2B • Updated • 1.91k • 4 -
knoveleng/Open-RS2
Text Generation • 2B • Updated • 1.87k • 1
Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t"
-
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't
Paper • 2503.16219 • Published • 51 -
knoveleng/OpenRS-GRPO
Text Generation • 2B • Updated • 89 • 5 -
knoveleng/Open-RS1
Text Generation • 2B • Updated • 1.91k • 4 -
knoveleng/Open-RS2
Text Generation • 2B • Updated • 1.87k • 1
datasets
7
knoveleng/redbench
Viewer
•
Updated
•
29.4k
•
140
knoveleng/open-rs
Viewer
•
Updated
•
7k
•
1.54k
•
11
knoveleng/open-deepscaler
Viewer
•
Updated
•
21k
•
85
•
4
knoveleng/open-s1
Viewer
•
Updated
•
18.6k
•
149
•
4
knoveleng/AMC-23
Viewer
•
Updated
•
40
•
3.62k
knoveleng/OlympiadBench
Viewer
•
Updated
•
675
•
1.91k
•
1
knoveleng/Minerva-Math
Viewer
•
Updated
•
272
•
2.25k
•
1