Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Fan Zhou
koalazf99
AI & ML interests
Deep Learning; Natural Language Processing; Foundation Models
Recent Activity
liked
a dataset
about 5 hours ago
OctoThinker/MegaMath-Web-Pro-Max
updated
a collection
about 7 hours ago
🐙 OctoThinker
upvoted
a
paper
about 8 hours ago
OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling