arxiv:2506.05309

Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games

Published on Jun 5

· Submitted by

niveck on Jun 12

Upvote

Authors:

Niv Eckhaus ,

Abstract

An adaptive asynchronous LLM-agent performs similarly to human players in online Mafia games, demonstrating the potential for integrating LLMs into realistic group settings with complex social dynamics.

AI-generated summary

LLMs are used predominantly in synchronous communication, where a human user and a model communicate in alternating turns. In contrast, many real-world settings are inherently asynchronous. For example, in group chats, online team meetings, or social games, there is no inherent notion of turns; therefore, the decision of when to speak forms a crucial part of the participant's decision making. In this work, we develop an adaptive asynchronous LLM-agent which, in addition to determining what to say, also decides when to say it. To evaluate our agent, we collect a unique dataset of online Mafia games, including both human participants, as well as our asynchronous agent. Overall, our agent performs on par with human players, both in game performance, as well as in its ability to blend in with the other human players. Our analysis shows that the agent's behavior in deciding when to speak closely mirrors human patterns, although differences emerge in message content. We release all our data and code to support and encourage further research for more realistic asynchronous communication between LLM agents. This work paves the way for integration of LLMs into realistic human group settings, from assistance in team discussions to educational and professional environments where complex social dynamics must be navigated.

View arXiv page View PDF Project page GitHub repository Add to collection

Community

niveck

Paper author Paper submitter 1 day ago

LLM agents that don't just decide what to say, but also when to say it in group conversations.

The Problem: Most AI is designed for turn-taking (you ask, it answers). But real communication is asynchronous - like in face to face interaction, and also in written group interaction like chats: timing matters as much as content.

Our Solution: We built an agent with two components:
⏰🤖 Scheduler: Decides WHETHER to post a message right now
✍️🤖 Generator: Composes the actual message content

We tested it in games of Mafia (a.k.a/similar to Werewolf/Resistance/Among Us/Traitors/...) alongside human players! We publish all of the data as a new dataset: LLMafia (available on HuggingFace 🤗).

Results:
✅ Agent matches human timing patterns
✅ Similar win rates to humans
✅ Humans fail to identify the agent >40% of the time
❌ Agent messages are distinguishable by classifiers

This opens doors for AI in team collaboration, group learning, and any setting where natural conversation flow matters!