arxiv:2509.15207
zhu
xuekai
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
7 days ago
P1: Mastering Physics Olympiads with Reinforcement Learning
commented on
a paper
28 days ago
FlowRL: Matching Reward Distributions for LLM Reasoning
updated
a model
29 days ago
xuekai/FlowRL-DeepSeek-7B-code