Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
reasoning-project
community
Activity Feed
Follow
2
AI & ML interests
None defined yet.
Recent Activity
JW17
authored
a paper
14 days ago
AlphaPO -- Reward shape matters for LLM alignment
JW17
authored
a paper
14 days ago
Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning
JW17
authored
a paper
about 2 months ago
When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research
View all activity
Team members
3
models
4
Sort: Recently updated
reasoning-project/Q25M-1.5B-MR1-50k-SFT-v0.2-3epoch
Text Generation
•
2B
•
Updated
Feb 16
•
4
reasoning-project/Q25M-1.5B-Open-R1-55k-SFT-v0.1
Text Generation
•
2B
•
Updated
Feb 15
•
3
reasoning-project/Q25-1.5B-PRIME-55K-GRPO-Acc2-format5e1
Updated
Feb 14
reasoning-project/Q25-1.5B-Open-R1-55K-GRPO-Acc2-format5e1
Updated
Feb 14
datasets
0
None public yet