Hanbin Wang's picture

17 4 4

Hanbin Wang

hanbin

·

https://wanghanbinpanda.github.io/

wanghanbinpanda

AI & ML interests

Code Intelligence and LLM Reasoning (Code, Math)

Recent Activity

updated a model 4 days ago

PRIME-RL/Eurus-2-7B-PRIME

updated a model 4 days ago

PRIME-RL/Eurus-2-7B-SFT

updated a dataset 12 days ago

PRIME-RL/Eurus-2-RL-Data

View all activity

Articles

Process Reinforcement through Implicit Rewards

Organizations

hanbin's activity

upvoted an article 15 days ago

Article

Process Reinforcement through Implicit Rewards

By

•

15 days ago

• 16

upvoted a paper about 2 months ago

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 30

upvoted a collection 10 months ago

Eurus

Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Oct 22, 2024 • 24

upvoted a paper 10 months ago

Advancing LLM Reasoning Generalists with Preference Trees

Paper • 2404.02078 • Published Apr 2, 2024 • 44