Dian Yu's picture

1 14 3

Dian Yu

yudian

·

https://scholar.google.com/citations?user=ERdzqyYAAAAJ&hl=en

AI & ML interests

NLP

Recent Activity

upvoted a paper 10 days ago

Complex Logical Instruction Generation

authored a paper about 1 month ago

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

authored a paper about 1 month ago

One Token to Fool LLM-as-a-Judge

View all activity

Organizations

None yet

authored 2 papers about 1 month ago

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published Apr 15 • 13

One Token to Fool LLM-as-a-Judge

Paper • 2507.08794 • Published Jul 11 • 31

authored 4 papers 5 months ago

Expanding RL with Verifiable Rewards Across Diverse Domains

Paper • 2503.23829 • Published Mar 31 • 24

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 42

OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas

Paper • 2501.15427 • Published Jan 26 • 6

Improving LLM General Preference Alignment via Optimistic Online Mirror Descent

Paper • 2502.16852 • Published Feb 24

authored a paper 7 months ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 61

authored 12 papers about 1 year ago

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Paper • 2407.00617 • Published Jun 30, 2024 • 7

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29, 2024 • 40

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 105

DREAM: A Challenge Dataset and Models for Dialogue-Based Reading Comprehension

Paper • 1902.00164 • Published Feb 1, 2019

Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension

Paper • 1904.09679 • Published Apr 21, 2019

Dialogue-Based Relation Extraction

Paper • 2004.08056 • Published Apr 17, 2020

CLUE: A Chinese Language Understanding Evaluation Benchmark

Paper • 2004.05986 • Published Apr 13, 2020

Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models

Paper • 2308.00304 • Published Aug 1, 2023 • 23

Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations

Paper • 2311.04335 • Published Nov 7, 2023 • 1

Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models

Paper • 2401.08350 • Published Jan 16, 2024

MinT: Boosting Generalization in Mathematical Reasoning via Multi-View Fine-Tuning

Paper • 2307.07951 • Published Jul 16, 2023

Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning

Paper • 2406.12050 • Published Jun 17, 2024 • 19

authored a paper over 1 year ago

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18, 2024 • 56