Xihui Liu

XihuiLiu

·

https://xh-liu.github.io/

AI & ML interests

None yet

Organizations

upvoted 2 papers 3 months ago

PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World

Paper • 2605.05163 • Published May 6 • 37

MultiWorld: Scalable Multi-Agent Multi-View Video World Models

Paper • 2604.18564 • Published Apr 20 • 47

upvoted a paper 4 months ago

MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data

Paper • 2603.25319 • Published Mar 26 • 32

upvoted a paper 7 months ago

Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation

Paper • 2512.08186 • Published Dec 9, 2025 • 23

upvoted a paper 8 months ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published Dec 9, 2025 • 134

upvoted 2 papers 9 months ago

Speculative Jacobi-Denoising Decoding for Accelerating Autoregressive Text-to-image Generation

Paper • 2510.08994 • Published Oct 10, 2025 • 4

CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images

Paper • 2510.11718 • Published Oct 13, 2025 • 14

upvoted 2 papers about 1 year ago

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Paper • 2507.06165 • Published Jul 8, 2025 • 60

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning

Paper • 2505.17022 • Published May 22, 2025 • 27

upvoted 11 papers over 1 year ago

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

Paper • 2504.08736 • Published Apr 11, 2025 • 46

HoloPart: Generative 3D Part Amodal Segmentation

Paper • 2504.07943 • Published Apr 10, 2025 • 28

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published Mar 31, 2025 • 38

Position: Interactive Generative Video as Next-Generation Game Engine

Paper • 2503.17359 • Published Mar 21, 2025 • 61

AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset

Paper • 2503.19462 • Published Mar 25, 2025 • 10

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

Paper • 2503.16430 • Published Mar 20, 2025 • 34

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published Mar 13, 2025 • 53

GameFactory: Creating New Games with Generative Interactive Videos

Paper • 2501.08325 • Published Jan 14, 2025 • 68

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published Dec 19, 2024 • 52

GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration

Paper • 2412.04440 • Published Dec 5, 2024 • 22

Moto: Latent Motion Token as the Bridging Language for Robot Manipulation

Paper • 2412.04445 • Published Dec 5, 2024 • 22