Rui Hu
Raynhu
ยท
AI & ML interests
LLM Post-Training & Agentic RL & End2End Agent
Recent Activity
upvoted
a
paper
3 days ago
Self-Reflective Generation at Test Time
upvoted
a
paper
5 months ago
When to Continue Thinking: Adaptive Thinking Mode Switching for
Efficient Reasoning
updated
a model
5 months ago
Raynhu/Qwen2.5-Omni-7B-SFT
Organizations
None yet