XumengWen
XumengWen
AI & ML interests
None yet
Recent Activity
authored
a paper
16 days ago
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes
Correct Reasoning in Base LLMs
liked
a Space
about 2 months ago
Qwen/Qwen2.5