Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Wayne
back-prop
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a model
12 days ago
sunblaze-ucb/Qwen2.5-1.5B-GRPO-MATH-1EPOCH
updated
a model
12 days ago
sunblaze-ucb/Qwen2.5-3B-GRPO-MATH-1EPOCH
published
a model
12 days ago
sunblaze-ucb/Qwen2.5-1.5B-GRPO-MATH-1EPOCH
View all activity
Organizations
back-prop
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
a
paper
18 days ago
Learning to Reason without External Rewards
Paper
•
2505.19590
•
Published
19 days ago
•
27