Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Wayne
back-prop
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a model
12 days ago
sunblaze-ucb/Qwen2.5-1.5B-GRPO-MATH-1EPOCH
updated
a model
12 days ago
sunblaze-ucb/Qwen2.5-3B-GRPO-MATH-1EPOCH
published
a model
12 days ago
sunblaze-ucb/Qwen2.5-1.5B-GRPO-MATH-1EPOCH
View all activity
Organizations
back-prop
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
2 models
12 days ago
sunblaze-ucb/Qwen2.5-1.5B-GRPO-MATH-1EPOCH
Text Generation
•
Updated
12 days ago
•
27
sunblaze-ucb/Qwen2.5-3B-GRPO-MATH-1EPOCH
Text Generation
•
Updated
12 days ago
•
24
published
2 models
12 days ago
sunblaze-ucb/Qwen2.5-1.5B-GRPO-MATH-1EPOCH
Text Generation
•
Updated
12 days ago
•
27
sunblaze-ucb/Qwen2.5-3B-GRPO-MATH-1EPOCH
Text Generation
•
Updated
12 days ago
•
24
updated
2 models
12 days ago
sunblaze-ucb/Qwen2.5-3B-Intuitor-MATH-1EPOCH
Text Generation
•
Updated
12 days ago
•
1.2k
back-prop/Qwen2.5-SPO-3B-clean
Text Generation
•
Updated
12 days ago
•
24
published
a model
12 days ago
sunblaze-ucb/Qwen2.5-3B-Intuitor-MATH-1EPOCH
Text Generation
•
Updated
12 days ago
•
1.2k
updated
2 models
12 days ago
back-prop/Qwen2.5-SPO-1.5B
Text Generation
•
Updated
12 days ago
•
46
sunblaze-ucb/Qwen2.5-1.5B-Intuitor-MATH-1EPOCH
Text Generation
•
Updated
12 days ago
•
87
published
a model
12 days ago
sunblaze-ucb/Qwen2.5-1.5B-Intuitor-MATH-1EPOCH
Text Generation
•
Updated
12 days ago
•
87
updated
2 models
12 days ago
back-prop/Qwen2.5-GRPO-1.5B
Text Generation
•
Updated
12 days ago
•
14
back-prop/Qwen2.5-GRPO-3B
Text Generation
•
Updated
12 days ago
•
15
upvoted
a
paper
18 days ago
Learning to Reason without External Rewards
Paper
•
2505.19590
•
Published
19 days ago
•
27
published
3 models
19 days ago
back-prop/Qwen2.5-SPO-3B-clean
Text Generation
•
Updated
12 days ago
•
24
back-prop/Qwen2.5-GRPO-1.5B
Text Generation
•
Updated
12 days ago
•
14
back-prop/Qwen2.5-SPO-1.5B
Text Generation
•
Updated
12 days ago
•
46
published
a model
24 days ago
back-prop/Qwen2.5-GRPO-3B
Text Generation
•
Updated
12 days ago
•
15
updated
a model
24 days ago
back-prop/Qwen2.5-SPO-3B
Updated
24 days ago
•
9
published
a model
24 days ago
back-prop/Qwen2.5-SPO-3B
Updated
24 days ago
•
9