61 32 114

Chujie Zheng

chujiezheng

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

authored a paper 24 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

upvoted a paper 24 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

authored a paper about 1 month ago

BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs

View all activity

Organizations

upvoted a paper 24 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 25 days ago • 162

upvoted 4 papers about 1 month ago

upvoted a collection about 2 months ago

Qwen3

Collection

72 items • Updated 12 days ago • 804

upvoted 3 papers 4 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 193

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 104

upvoted 3 papers 5 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 99

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 76

upvoted a paper 6 months ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 53

upvoted a collection 6 months ago

QVQ

Collection

QVQ: Qwen models for visual reasoning • 7 items • Updated Apr 28 • 50

upvoted a paper 6 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 368

upvoted 3 papers 7 months ago

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published Dec 6, 2024 • 51

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 84

Yi-Lightning Technical Report

Paper • 2412.01253 • Published Dec 2, 2024 • 29

upvoted a collection 7 months ago

QwQ

Collection

Qwen with Questions • 6 items • Updated Apr 28 • 97

upvoted an article 8 months ago

Article

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

•

Oct 24, 2024

• 12

Chujie Zheng

AI & ML interests

Recent Activity

Organizations

chujiezheng's activity

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick