Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions Paper • 2502.18435 • Published Feb 25 • 1
Omni-Router: Sharing Routing Decisions in Sparse Mixture-of-Experts for Speech Recognition Paper • 2507.05724 • Published Jul 8 • 1
Theory, Analysis, and Best Practices for Sigmoid Self-Attention Paper • 2409.04431 • Published Sep 6, 2024 • 2
Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition Paper • 2405.15216 • Published May 24, 2024 • 16