ji
gongwu
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 20 hours ago
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes
Correct Reasoning in Base LLMs
upvoted
a
paper
5 months ago
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep
Thinking
Organizations
None yet
models
0
None public yet
datasets
0
None public yet