ChengpengLi

3 18 2

AI & ML interests

LLM for Reasoning, reinforcement learning, recommendation system, diffusion models

Recent Activity

upvoted a paper 2 months ago

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

upvoted a paper 3 months ago

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

upvoted a paper 5 months ago

Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers

View all activity

Organizations

None yet

upvoted a paper 2 months ago

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published Apr 20 • 88

upvoted a paper 3 months ago

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

Paper • 2604.10866 • Published Apr 13 • 69

upvoted a paper 5 months ago

Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers

Paper • 2602.06079 • Published Feb 4 • 21

upvoted a paper 6 months ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 164

upvoted a paper 7 months ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published Dec 15, 2025 • 65

upvoted 2 papers 9 months ago

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16, 2025 • 109

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 119

upvoted 2 papers 11 months ago

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning

Paper • 2508.10433 • Published Aug 14, 2025 • 146

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 161

commented a paper about 1 year ago

CoRT: Code-integrated Reasoning within Thinking

Paper • 2506.09820 • Published Jun 11, 2025 • 18 •

upvoted a paper about 1 year ago

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Paper • 2505.16410 • Published May 22, 2025 • 59

authored a paper over 1 year ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113

commented a paper over 1 year ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113 •

upvoted a paper over 1 year ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113

published a model over 1 year ago

ChengpengLi/START

Updated Feb 21, 2025

upvoted 3 papers over 1 year ago

liked a Space over 1 year ago

Qwen2.5 Math Demo

🧮

244

Answer math questions from uploaded images or sketches

upvoted a collection almost 2 years ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 10 items • Updated Mar 2 • 92

ChengpengLi

AI & ML interests

Recent Activity

Organizations

ChengpengLi's activity

Qwen2.5 Math Demo