zhu

xuekai

AI & ML interests

None yet

Recent Activity

Organizations

TsinghuaC3I's profile picture

xuekai's activity

upvoted an article 4 months ago
view article
Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By NormalUhr
152
upvoted 2 articles 5 months ago
view article
Article

Process Reinforcement through Implicit Rewards

By ganqu and 1 other
27
upvoted an article 5 months ago