zhu's picture

5 33 1

zhu

xuekai

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

P1: Mastering Physics Olympiads with Reinforcement Learning

commented on a paper 3 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

updated a model 3 months ago

xuekai/FlowRL-DeepSeek-7B-code

View all activity

Organizations

Papers 16

arxiv:2509.15207

arxiv:2509.09674

arxiv:2509.08827

arxiv:2509.04419

models 3

xuekai/FlowRL-DeepSeek-7B-code

8B • Updated Oct 27, 2025 • 246

xuekai/FlowRL-Qwen2.5-32B-math

33B • Updated Oct 27, 2025

xuekai/FlowRL-Qwen2.5-7B-math

8B • Updated Oct 27, 2025 • 5

datasets 2

xuekai/flowrl-data-collection

Preview • Updated Sep 28, 2025 • 131

xuekai/pad_train

Viewer • Updated Mar 21, 2024 • 184k • 11