Seedance 1.0: Exploring the Boundaries of Video Generation Models Paper • 2506.09113 • Published 4 days ago • 62
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces Paper • 2506.00123 • Published 15 days ago • 33
What Matters in Training a GPT4-Style Language Model with Multimodal Inputs? Paper • 2307.02469 • Published Jul 5, 2023 • 12
Boximator: Generating Rich and Controllable Motions for Video Synthesis Paper • 2402.01566 • Published Feb 2, 2024 • 28
CodePlan: Repository-level Coding using LLMs and Planning Paper • 2309.12499 • Published Sep 21, 2023 • 78