siyeng feng
siyengfeng
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 11 hours ago
S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement
Learning
upvoted
a
paper
about 11 hours ago
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement
Learning
upvoted
a
paper
about 11 hours ago
S*: Test Time Scaling for Code Generation
Organizations
None yet
models
None public yet
datasets
None public yet