Yang Su's picture

3 4 17

Yang Su

yang-su2000

·

https://alicellm.github.io/

AI & ML interests

Long-Horizon RL Agent Alignment

Recent Activity

liked a dataset 22 days ago

openai/gdpval

liked a dataset 3 months ago

Agent-Ark/Toucan-1.5M

new activity 8 months ago

Qwen/Qwen3-32B:The correct way of fine-tuning on multi-turn trajectories

View all activity

Organizations

upvoted a paper 10 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 113

upvoted 2 papers about 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 84

upvoted a collection almost 2 years ago

Gemma release

Groups the Gemma models released by the Google team. • 40 items • Updated Jul 10 • 347