Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Fan Zhou's picture

Fan Zhou

koalazf99

Fishtiks's profile picture

Zhihui's profile picture

Vfrz's profile picture

·

https://koalazf99.github.io/

FaZhou_998
koalazf99

AI & ML interests

Deep Learning; Natural Language Processing; Foundation Models

Organizations

koalazf99 's collections 4

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49
LLM360/guru-RL-92k

Viewer • Updated 19 days ago • 91.9k • 1.05k • 19
LLM360/guru-7B

Text Generation • 8B • Updated Jun 19 • 1.18k • • 1
LLM360/guru-32B

Text Generation • 33B • Updated Jun 19 • 18

An Open Math Pre-trainng Dataset with 370B Tokens.

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3 • 34
LLM360/MegaMath

Viewer • Updated Apr 9 • 217M • 26.5k • 98
LLM360/MegaMath-Llama-3.2-3B

Text Generation • 3B • Updated Apr 16 • 10 • 5
LLM360/MegaMath-Llama-3.2-1B

Text Generation • 1B • Updated Apr 16 • 8 • 1

🐙 OctoThinker

Mid-training Incentivizes Reinforcement Learning Scaling

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25 • 46
OctoThinker/MegaMath-Web-Pro-Max

Viewer • Updated Jul 6 • 69.2M • 8.64k • 35
OctoThinker/OctoThinker-8B-Long-Base

Text Generation • 8B • Updated Jul 6 • 15
OctoThinker/OctoThinker-8B-Hybrid-Base

Text Generation • 8B • Updated Jul 6 • 143 • 2

🫐 ProX Projects

Collection for: "Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale"

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 64
gair-prox/DCLM-pro

Viewer • Updated Feb 15 • 366M • 2.6k • 11
gair-prox/FineWeb-pro

Viewer • Updated Sep 26, 2024 • 63.1M • 965 • 26
gair-prox/open-web-math-pro

Viewer • Updated Sep 26, 2024 • 2.58M • 533 • 12

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49
LLM360/guru-RL-92k

Viewer • Updated 19 days ago • 91.9k • 1.05k • 19
LLM360/guru-7B

Text Generation • 8B • Updated Jun 19 • 1.18k • • 1
LLM360/guru-32B

Text Generation • 33B • Updated Jun 19 • 18

🐙 OctoThinker

Mid-training Incentivizes Reinforcement Learning Scaling

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25 • 46
OctoThinker/MegaMath-Web-Pro-Max

Viewer • Updated Jul 6 • 69.2M • 8.64k • 35
OctoThinker/OctoThinker-8B-Long-Base

Text Generation • 8B • Updated Jul 6 • 15
OctoThinker/OctoThinker-8B-Hybrid-Base

Text Generation • 8B • Updated Jul 6 • 143 • 2

An Open Math Pre-trainng Dataset with 370B Tokens.

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3 • 34
LLM360/MegaMath

Viewer • Updated Apr 9 • 217M • 26.5k • 98
LLM360/MegaMath-Llama-3.2-3B

Text Generation • 3B • Updated Apr 16 • 10 • 5
LLM360/MegaMath-Llama-3.2-1B

Text Generation • 1B • Updated Apr 16 • 8 • 1

🫐 ProX Projects

Collection for: "Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale"

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 64
gair-prox/DCLM-pro

Viewer • Updated Feb 15 • 366M • 2.6k • 11
gair-prox/FineWeb-pro

Viewer • Updated Sep 26, 2024 • 63.1M • 965 • 26
gair-prox/open-web-math-pro

Viewer • Updated Sep 26, 2024 • 2.58M • 533 • 12

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs