Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games Paper • 2506.05309 • Published 9 days ago • 11
Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation Paper • 2506.08570 • Published 4 days ago • 27
Debatable Intelligence: Benchmarking LLM Judges via Debate Speech Evaluation Paper • 2506.05062 • Published 9 days ago • 14 • 2
Debatable Intelligence: Benchmarking LLM Judges via Debate Speech Evaluation Paper • 2506.05062 • Published 9 days ago • 14
Debatable Intelligence: Benchmarking LLM Judges via Debate Speech Evaluation Paper • 2506.05062 • Published 9 days ago • 14