SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation Paper β’ 2503.09641 β’ Published 17 days ago β’ 29
SANA-Sprint Collection πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation β’ 5 items β’ Updated 2 days ago β’ 7
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization Paper β’ 2412.17739 β’ Published Dec 23, 2024 β’ 41
FastVLM: Efficient Vision Encoding for Vision Language Models Paper β’ 2412.13303 β’ Published Dec 17, 2024 β’ 13
FashionComposer: Compositional Fashion Image Generation Paper β’ 2412.14168 β’ Published Dec 18, 2024 β’ 16
FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on Paper β’ 2411.10499 β’ Published Nov 15, 2024 β’ 13
RedPajama: an Open Dataset for Training Large Language Models Paper β’ 2411.12372 β’ Published Nov 19, 2024 β’ 55
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality Paper β’ 2410.19355 β’ Published Oct 25, 2024 β’ 23
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper β’ 2411.10440 β’ Published Nov 15, 2024 β’ 122
MarDini: Masked Autoregressive Diffusion for Video Generation at Scale Paper β’ 2410.20280 β’ Published Oct 26, 2024 β’ 23
What Matters in Transformers? Not All Attention is Needed Paper β’ 2406.15786 β’ Published Jun 22, 2024 β’ 31
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices Paper β’ 2410.11795 β’ Published Oct 15, 2024 β’ 18
Animate-X: Universal Character Image Animation with Enhanced Motion Representation Paper β’ 2410.10306 β’ Published Oct 14, 2024 β’ 56
Progressive Autoregressive Video Diffusion Models Paper β’ 2410.08151 β’ Published Oct 10, 2024 β’ 16
Pyramidal Flow Matching for Efficient Video Generative Modeling Paper β’ 2410.05954 β’ Published Oct 8, 2024 β’ 39