Implicit Reasoning in Transformers is Reasoning through Shortcuts Paper • 2503.07604 • Published Mar 10 • 22
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators Paper • 2501.09484 • Published Jan 16 • 19
AAAR-1.0: Assessing AI's Potential to Assist Research Paper • 2410.22394 • Published Oct 29, 2024 • 16
Revealing the Barriers of Language Agents in Planning Paper • 2410.12409 • Published Oct 16, 2024 • 28
QUERT: Continual Pre-training of Language Model for Query Understanding in Travel Domain Search Paper • 2306.06707 • Published Jun 11, 2023
From Persona to Personalization: A Survey on Role-Playing Language Agents Paper • 2404.18231 • Published Apr 28, 2024 • 1
How Easily do Irrelevant Inputs Skew the Responses of Large Language Models? Paper • 2404.03302 • Published Apr 4, 2024 • 2
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following Paper • 2312.02436 • Published Dec 5, 2023 • 1
TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper • 2402.01622 • Published Feb 2, 2024 • 37
Adaptive Chameleon or Stubborn Sloth: Unraveling the Behavior of Large Language Models in Knowledge Clashes Paper • 2305.13300 • Published May 22, 2023 • 2