LawFlow : Collecting and Simulating Lawyers' Thought Processes Paper • 2504.18942 • Published 7 days ago • 4
Learning Explainable Dense Reward Shapes via Bayesian Optimization Paper • 2504.16272 • Published 11 days ago • 5
Benchmarking Cognitive Biases in Large Language Models as Evaluators Paper • 2309.17012 • Published Sep 29, 2023 • 3
Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models Paper • 2504.20157 • Published 5 days ago • 32
CoEdIT: Text Editing by Task-Specific Instruction Tuning Paper • 2305.09857 • Published May 17, 2023 • 7