MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation Paper • 2303.00628 • Published Mar 1, 2023
Beyond Random Sampling: Efficient Language Model Pretraining via Curriculum Learning Paper • 2506.11300 • Published Jun 12 • 1
Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts Paper • 2507.04569 • Published 29 days ago • 19
Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts Paper • 2507.04569 • Published 29 days ago • 19
Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts Paper • 2507.04569 • Published 29 days ago • 19
Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts Paper • 2507.04569 • Published 29 days ago • 19
Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts Paper • 2507.04569 • Published 29 days ago • 19
Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts Paper • 2507.04569 • Published 29 days ago • 19
Lost in the Mix: Evaluating LLM Understanding of Code-Switched Text Paper • 2506.14012 • Published Jun 16 • 10
Lost in the Mix: Evaluating LLM Understanding of Code-Switched Text Paper • 2506.14012 • Published Jun 16 • 10
Medical Dead-ends and Learning to Identify High-risk States and Treatments Paper • 2110.04186 • Published Oct 8, 2021
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published Jun 17 • 49
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published Jun 17 • 49
Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples Paper • 2502.09650 • Published Feb 11
LLM as a Broken Telephone: Iterative Generation Distorts Information Paper • 2502.20258 • Published Feb 27 • 27
LLM as a Broken Telephone: Iterative Generation Distorts Information Paper • 2502.20258 • Published Feb 27 • 27