Hanbin Wang

hanbin

AI & ML interests

Code Intelligence and LLM Reasoning (Code, Math)

Recent Activity

updated a model 4 days ago
PRIME-RL/Eurus-2-7B-PRIME
updated a model 4 days ago
PRIME-RL/Eurus-2-7B-SFT
updated a dataset 12 days ago
PRIME-RL/Eurus-2-RL-Data
View all activity

Articles

Organizations

OpenBMB's profile picture PRIME's profile picture

hanbin's activity

upvoted an article 15 days ago
view article
Article

Process Reinforcement through Implicit Rewards

By ganqu
16