An Analysis of Hyper-Parameter Optimization Methods for Retrieval Augmented Generation Paper • 2505.03452 • Published May 6 • 2
Debatable Intelligence: Benchmarking LLM Judges via Debate Speech Evaluation Paper • 2506.05062 • Published Jun 5 • 15
JuStRank: Benchmarking LLM Judges for System Ranking Paper • 2412.09569 • Published Dec 12, 2024 • 20
The Benefits of Bad Advice: Autocontrastive Decoding across Model Layers Paper • 2305.01628 • Published May 2, 2023
Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI Paper • 2401.14019 • Published Jan 25, 2024 • 24