S2R
S2R-data
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
12 days ago
SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning
upvoted
a
paper
3 months ago
S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement
Learning