Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

reasoning-project

community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

JW17  authored a paper 14 days ago
AlphaPO -- Reward shape matters for LLM alignment
JW17  authored a paper 14 days ago
Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning
JW17  authored a paper about 2 months ago
When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research
View all activity

GUIJIN SON's profile picture Hyunwoo Ko's profile picture Jiwoo Hong's profile picture

models 4

reasoning-project/Q25M-1.5B-MR1-50k-SFT-v0.2-3epoch

Text Generation • 2B • Updated Feb 16 • 4

reasoning-project/Q25M-1.5B-Open-R1-55k-SFT-v0.1

Text Generation • 2B • Updated Feb 15 • 3

reasoning-project/Q25-1.5B-PRIME-55K-GRPO-Acc2-format5e1

Updated Feb 14

reasoning-project/Q25-1.5B-Open-R1-55K-GRPO-Acc2-format5e1

Updated Feb 14

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs