Fan Zhou
koalazf99
AI & ML interests
Deep Learning; Natural Language Processing; Foundation Models
Recent Activity
upvoted
a
paper
30 minutes ago
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
authored
a paper
2 months ago
OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling
new activity
2 months ago
OctoThinker/MegaMath-Web-Pro-Max:[bot] Conversion to Parquet