Wayne's picture

1

Wayne

back-prop

·

AI & ML interests

None yet

Recent Activity

updated a model 12 days ago

sunblaze-ucb/Qwen2.5-1.5B-GRPO-MATH-1EPOCH

updated a model 12 days ago

sunblaze-ucb/Qwen2.5-3B-GRPO-MATH-1EPOCH

published a model 12 days ago

sunblaze-ucb/Qwen2.5-1.5B-GRPO-MATH-1EPOCH

View all activity

Organizations

back-prop's activity

upvoted a paper 18 days ago

Learning to Reason without External Rewards

Paper • 2505.19590 • Published 19 days ago • 27