6 22 1

Wenkai Yang

Keven16

https://keven980716.github.io/

keven980716

AI & ML interests

None yet

Recent Activity

authored a paper 20 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

upvoted a paper 22 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

authored a paper about 2 months ago

AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents

View all activity

Organizations

None yet

authored a paper 20 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 23 days ago • 90

upvoted a paper 22 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 23 days ago • 90

authored a paper about 2 months ago

AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents

Paper • 2603.14465 • Published Mar 15 • 23

updated a dataset about 2 months ago

Keven16/OPSD-Example-Data

Viewer • Updated Mar 18 • 49.1k • 84

published a dataset about 2 months ago

Keven16/OPSD-Example-Data

Viewer • Updated Mar 18 • 49.1k • 84

upvoted a paper about 2 months ago

AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents

Paper • 2603.14465 • Published Mar 15 • 23

updated 2 models about 2 months ago

Keven16/Qwen3-4B-Non-Thinking-RL-Code-Step300

4B • Updated Mar 16 • 22

Keven16/Qwen3-4B-Non-Thinking-RL-Math-Step500

4B • Updated Mar 16 • 2.96k

published 2 models about 2 months ago

Keven16/Qwen3-4B-Non-Thinking-RL-Code-Step300

4B • Updated Mar 16 • 22

Keven16/Qwen3-4B-Non-Thinking-RL-Math-Step500

4B • Updated Mar 16 • 2.96k

liked a dataset about 2 months ago

LulaCola/AgentProcessBench

Viewer • Updated Mar 18 • 1k • 230 • 14

authored 2 papers 3 months ago

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

Paper • 2602.12125 • Published Feb 12 • 64

Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning

Paper • 2506.07851 • Published Jun 9, 2025

updated a dataset 3 months ago

Keven16/G-OPD-Training-Data

Viewer • Updated Feb 17 • 134k • 452 • 1

published a dataset 3 months ago

Keven16/G-OPD-Training-Data

Viewer • Updated Feb 17 • 134k • 452 • 1

upvoted a paper 3 months ago

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

Paper • 2602.12125 • Published Feb 12 • 64

submitted a paper to Daily Papers 3 months ago

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

Paper • 2602.12125 • Published Feb 12 • 64

upvoted 2 papers 3 months ago

Fine-T2I: An Open, Large-Scale, and Diverse Dataset for High-Quality T2I Fine-Tuning

Paper • 2602.09439 • Published Feb 10 • 13

AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research

Paper • 2602.06540 • Published Feb 6 • 21

upvoted a paper 4 months ago

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

Paper • 2601.13761 • Published Jan 20 • 16

Wenkai Yang

AI & ML interests

Recent Activity

Organizations

Keven16's activity