view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 3 days ago • 72
view article Article MotionLCM-V2: Improved Compression Rate for Multi-Latent-Token Diffusion By wxDai • Dec 11, 2024 • 14
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Paper • 2412.07589 • Published Dec 10, 2024 • 45