ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering Paper • 2410.05077 • Published Oct 7, 2024 • 3
Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering Paper • 2503.14996 • Published Mar 19 • 1
Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation Paper • 2504.17025 • Published Apr 23 • 17
Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation Paper • 2504.17025 • Published Apr 23 • 17
Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation Paper • 2504.17025 • Published Apr 23 • 17
Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends Paper • 2407.21489 • Published Jul 31, 2024 • 2
LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps Paper • 2412.15035 • Published Dec 19, 2024 • 4
What's the Meaning of Superhuman Performance in Today's NLU? Paper • 2305.08414 • Published May 15, 2023 • 1
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OASIS Paper • 2411.19655 • Published Nov 29, 2024 • 20
LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps Paper • 2412.15035 • Published Dec 19, 2024 • 4
Word Sense Linking: Disambiguating Outside the Sandbox Paper • 2412.09370 • Published Dec 12, 2024 • 9
Word Sense Linking: Disambiguating Outside the Sandbox Paper • 2412.09370 • Published Dec 12, 2024 • 9