CoverBench: A Challenging Benchmark for Complex Claim Verification Paper • 2408.03325 • Published Aug 6, 2024 • 15
Causes and Cures for Interference in Multilingual Translation Paper • 2212.07530 • Published Dec 14, 2022
What Do You Get When You Cross Beam Search with Nucleus Sampling? Paper • 2107.09729 • Published Jul 20, 2021
Instruction Induction: From Few Examples to Natural Language Task Descriptions Paper • 2205.10782 • Published May 22, 2022
Multilingual Instruction Tuning With Just a Pinch of Multilinguality Paper • 2401.01854 • Published Jan 3, 2024 • 11
ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding Paper • 2305.14196 • Published May 23, 2023
Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in Language Paper • 2103.01242 • Published Mar 1, 2021
Efficient Long-Text Understanding with Short-Text Models Paper • 2208.00748 • Published Aug 1, 2022 • 1
SCROLLS: Standardized CompaRison Over Long Language Sequences Paper • 2201.03533 • Published Jan 10, 2022 • 1