ZHOU's picture

6 1

ZHOU

TOBI-X

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

upvoted a paper 6 days ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

upvoted a paper 7 days ago

reWordBench: Benchmarking and Improving the Robustness of Reward Models with Transformed Inputs

View all activity

Organizations

None yet

TOBI-X's activity

upvoted a paper 4 days ago

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Paper • 2503.16419 • Published 4 days ago • 57

upvoted a paper 6 days ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published 6 days ago • 100

upvoted a paper 7 days ago

reWordBench: Benchmarking and Improving the Robustness of Reward Models with Transformed Inputs

Paper • 2503.11751 • Published 10 days ago • 15

upvoted a collection 12 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 17 items • Updated 4 days ago • 111

liked a dataset 16 days ago

amphora/MCLM

Viewer • Updated 20 days ago • 156 • 559 • 1

upvoted a paper about 1 month ago

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Paper • 2502.07346 • Published Feb 11 • 51

upvoted a collection 4 months ago

MoEs papers reading list

60 items • Updated Nov 4, 2024 • 141