-
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction
Paper • 2504.01014 • Published • 52 -
VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Paper • 2504.01956 • Published • 32 -
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance
Paper • 2504.01724 • Published • 57
Collections
Discover the best community collections!
Collections including paper arxiv:2504.01956
-
M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding
Paper • 2411.04952 • Published • 29 -
Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models
Paper • 2411.05005 • Published • 13 -
M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models
Paper • 2411.04075 • Published • 17 -
Self-Consistency Preference Optimization
Paper • 2411.04109 • Published • 19
-
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Paper • 2312.04483 • Published • 7 -
AnimateZero: Video Diffusion Models are Zero-Shot Image Animators
Paper • 2312.03793 • Published • 18 -
Photorealistic Video Generation with Diffusion Models
Paper • 2312.06662 • Published • 24 -
PEEKABOO: Interactive Video Generation via Masked-Diffusion
Paper • 2312.07509 • Published • 12
-
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Paper • 2311.12631 • Published • 15 -
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Paper • 2401.06066 • Published • 54 -
VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Paper • 2504.01956 • Published • 32