Shangziqi Zhao
zhaoshangziqi
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
7 days ago
From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by
Composing Old Ones
upvoted
a
paper
18 days ago
FlowRL: Matching Reward Distributions for LLM Reasoning
upvoted
a
paper
19 days ago
Towards a Unified View of Large Language Model Post-Training
Organizations
None yet