arxiv:2510.00915
Xin-Qiang Cai
caixq
AI & ML interests
RL, RLHF, Learning under Weak Supervision, Diffusion Model
Recent Activity
authored
a paper
24 days ago
PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models
authored
a paper
24 days ago
Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect
Verifiers
upvoted
a
paper
26 days ago
Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect
Verifiers
Organizations
None yet