Papers
arxiv:2506.18701

Matrix-Game: Interactive World Foundation Model

Published on Jun 23
ยท Submitted by Vanint on Jun 25
#3 Paper of the day
Authors:
,
,
,
,
,
,
,

Abstract

Matrix-Game, a controllable game world generation model trained in a two-stage process, outperforms existing models by producing high-quality, action-controllable, and physically consistent Minecraft world videos.

AI-generated summary

We introduce Matrix-Game, an interactive world foundation model for controllable game world generation. Matrix-Game is trained using a two-stage pipeline that first performs large-scale unlabeled pretraining for environment understanding, followed by action-labeled training for interactive video generation. To support this, we curate Matrix-Game-MC, a comprehensive Minecraft dataset comprising over 2,700 hours of unlabeled gameplay video clips and over 1,000 hours of high-quality labeled clips with fine-grained keyboard and mouse action annotations. Our model adopts a controllable image-to-world generation paradigm, conditioned on a reference image, motion context, and user actions. With over 17 billion parameters, Matrix-Game enables precise control over character actions and camera movements, while maintaining high visual quality and temporal coherence. To evaluate performance, we develop GameWorld Score, a unified benchmark measuring visual quality, temporal quality, action controllability, and physical rule understanding for Minecraft world generation. Extensive experiments show that Matrix-Game consistently outperforms prior open-source Minecraft world models (including Oasis and MineWorld) across all metrics, with particularly strong gains in controllability and physical consistency. Double-blind human evaluations further confirm the superiority of Matrix-Game, highlighting its ability to generate perceptually realistic and precisely controllable videos across diverse game scenarios. To facilitate future research on interactive image-to-world generation, we will open-source the Matrix-Game model weights and the GameWorld Score benchmark at https://github.com/SkyworkAI/Matrix-Game.

Community

Paper author Paper submitter

Matrix-Game โ€” a 17B+ parameter interactive world foundation model ๐Ÿš€. All model weights, inference code, and benchmarks are open-sourced. You're welcome to explore, use and discuss! ๐Ÿ˜†

๐Ÿ”— Homepage: https://matrix-game-homepage.github.io
๐Ÿ’ป GitHub: https://github.com/SkyworkAI/Matrix-Game

Outstanding! Would be cool to integrate some open interoperabiltiy protocols too like OMI (Open Metaverse Interoperability) extensions. There is one called omi_personality too which might be good for v2. Cheers!

Sign up or log in to comment

Models citing this paper 1

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2506.18701 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2506.18701 in a Space README.md to link it from this page.

Collections including this paper 1