Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper • 2406.06592 • Published Jun 5, 2024 • 30
Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset Paper • 1909.05855 • Published Sep 12, 2019
Template Guided Text Generation for Task-Oriented Dialogue Paper • 2004.15006 • Published Apr 30, 2020
UIBert: Learning Generic Multimodal Representations for UI Understanding Paper • 2107.13731 • Published Jul 29, 2021
SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems Paper • 2110.06800 • Published Oct 13, 2021
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models Paper • 2206.04615 • Published Jun 9, 2022 • 5
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback Paper • 2309.00267 • Published Sep 1, 2023 • 50
Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue Paper • 2204.04327 • Published Apr 8, 2022
PERL: Parameter Efficient Reinforcement Learning from Human Feedback Paper • 2403.10704 • Published Mar 15, 2024 • 60