Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena Paper • 2310.05746 • Published Oct 9, 2023 • 1
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization Paper • 2310.10134 • Published Oct 16, 2023 • 1
Like hiking? You probably enjoy nature: Persona-grounded Dialog with Commonsense Expansions Paper • 2010.03205 • Published Oct 7, 2020
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics Paper • 2102.01672 • Published Feb 2, 2021
Unsupervised Enrichment of Persona-grounded Dialog with Background Stories Paper • 2106.08364 • Published Jun 15, 2021
Large Language Models as Zero-Shot Conversational Recommenders Paper • 2308.10053 • Published Aug 19, 2023
Tell, Don't Show!: Language Guidance Eases Transfer Across Domains in Images and Videos Paper • 2403.05535 • Published Mar 8, 2024 • 1
DiscoveryBench: Towards Data-Driven Discovery with Large Language Models Paper • 2407.01725 • Published Jul 1, 2024