arxiv:2503.21614

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Published on Mar 27

· Submitted by

Xiaoye08 on Mar 31

#3 Paper of the day

Upvote

Authors:

Xiaoye Qu ,

Yafu Li ,

Zhaochen Su ,

Weigao Sun ,

Jianhao Yan ,

Dongrui Liu ,

Ganqu Cui ,

Shuxian Liang ,

Junxian He ,

Bowen Zhou ,

Abstract

Recent Large Reasoning Models (LRMs), such as DeepSeek-R1 and OpenAI o1, have demonstrated strong performance gains by scaling up the length of Chain-of-Thought (CoT) reasoning during inference. However, a growing concern lies in their tendency to produce excessively long reasoning traces, which are often filled with redundant content (e.g., repeated definitions), over-analysis of simple problems, and superficial exploration of multiple reasoning paths for harder tasks. This inefficiency introduces significant challenges for training, inference, and real-world deployment (e.g., in agent-based systems), where token economy is critical. In this survey, we provide a comprehensive overview of recent efforts aimed at improving reasoning efficiency in LRMs, with a particular focus on the unique challenges that arise in this new paradigm. We identify common patterns of inefficiency, examine methods proposed across the LRM lifecycle, i.e., from pretraining to inference, and discuss promising future directions for research. To support ongoing development, we also maintain a real-time GitHub repository tracking recent progress in the field. We hope this survey serves as a foundation for further exploration and inspires innovation in this rapidly evolving area.

View arXiv page View PDF GitHub repository Add to collection

Community

Xiaoye08

Paper author Paper submitter 3 days ago

In this survey, we provide a comprehensive overview of recent efforts aimed at improving reasoning efficiency in LRMs, with a particular focus on the unique challenges that arise in this new paradigm.

We provide a definition of reasoning efficiency, identify and characterize common patterns of reasoning inefficiency, and outline the current challenges that are unique to improving reasoning efficiency in large models.
We provide a comprehensive review of recent advancements aimed at enhancing reasoning
efficiency, structured across the end-to-end LRM development pipeline, from pretraining
and supervised fine-tuning to reinforcement learning and inference.