yi
zhongyi51
AI & ML interests
None yet
Recent Activity
commented on
a paper
5 days ago
Learning to Reason under Off-Policy Guidance
commented on
a paper
6 days ago
Does Reinforcement Learning Really Incentivize Reasoning Capacity in
LLMs Beyond the Base Model?
Organizations
None yet
models
0
None public yet
datasets
0
None public yet