arxiv:2512.24618
Xiaoyu Tan
WIlliam1900
AI & ML interests
None yet
Recent Activity
authored
a paper
10 days ago
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive
Exploration for Agentic Reinforcement Learning