Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
knoveleng 's Collections
Mathematics Benchmark Datasets
Open-RS

Open-RS

updated Mar 21

Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t"

Upvote
12

  • Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

    Paper • 2503.16219 • Published Mar 20 • 48

  • knoveleng/OpenRS-GRPO

    Text Generation • Updated Mar 21 • 75 • 5

  • knoveleng/Open-RS1

    Text Generation • Updated Mar 24 • 620 • 4

  • knoveleng/Open-RS2

    Text Generation • Updated Mar 24 • 504 • 1

  • knoveleng/Open-RS3

    Text Generation • Updated Mar 24 • 2.92k • 19

  • knoveleng/open-rs

    Viewer • Updated Mar 24 • 7k • 1.32k • 10

  • knoveleng/open-s1

    Viewer • Updated Mar 21 • 18.6k • 346 • 4

  • knoveleng/open-deepscaler

    Viewer • Updated Mar 21 • 21k • 134 • 4
Upvote
12
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs