The collection for the Paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
Xinyu Zhu
TianHongZXY
AI & ML interests
Large Language Models; Reasoning; Reinforcement Learning
Recent Activity
liked
a dataset
3 days ago
nvidia/Nemotron-Post-Training-Dataset-v1
upvoted
a
collection
14 days ago
RLVR-Decomposed
updated
a model
14 days ago
TianHongZXY/Qwen2.5-Math-7B-GRPO