Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published Feb 19 • 70
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM Paper • 2503.04724 • Published Mar 6 • 69
Audio-Aware Large Language Models as Judges for Speaking Styles Paper • 2506.05984 • Published 5 days ago • 14