Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published Dec 30, 2024 • 37
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search Paper • 2410.03864 • Published Oct 4, 2024 • 11
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning Paper • 2407.00617 • Published Jun 30, 2024 • 7
Scaling Synthetic Data Creation with 1,000,000,000 Personas Paper • 2406.20094 • Published Jun 28, 2024 • 98
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning Paper • 2407.00617 • Published Jun 30, 2024 • 7
Scaling Synthetic Data Creation with 1,000,000,000 Personas Paper • 2406.20094 • Published Jun 28, 2024 • 98
DREAM: A Challenge Dataset and Models for Dialogue-Based Reading Comprehension Paper • 1902.00164 • Published Feb 1, 2019
Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension Paper • 1904.09679 • Published Apr 21, 2019
CLUE: A Chinese Language Understanding Evaluation Benchmark Paper • 2004.05986 • Published Apr 13, 2020
Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models Paper • 2308.00304 • Published Aug 1, 2023 • 23
Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations Paper • 2311.04335 • Published Nov 7, 2023 • 1
Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models Paper • 2401.08350 • Published Jan 16, 2024
MinT: Boosting Generalization in Mathematical Reasoning via Multi-View Fine-Tuning Paper • 2307.07951 • Published Jul 16, 2023
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning Paper • 2406.12050 • Published Jun 17, 2024 • 19
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning Paper • 2406.12050 • Published Jun 17, 2024 • 19