Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization Paper • 2402.03161 • Published Feb 5, 2024 • 15
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization Paper • 2309.04669 • Published Sep 9, 2023 • 2
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance Paper • 2405.14677 • Published May 23, 2024 • 12
Harder Tasks Need More Experts: Dynamic Routing in MoE Models Paper • 2403.07652 • Published Mar 12, 2024
Pyramidal Flow Matching for Efficient Video Generative Modeling Paper • 2410.05954 • Published Oct 8, 2024 • 41