Optimal Control Meets Flow Matching: A Principled Route to Multi-Subject Fidelity Paper • 2510.02315 • Published 6 days ago • 5
Go with Your Gut: Scaling Confidence for Autoregressive Image Generation Paper • 2509.26376 • Published 8 days ago • 8
DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing Paper • 2510.02253 • Published 6 days ago • 10
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation Paper • 2510.02283 • Published 6 days ago • 77
Code2Video: A Code-centric Paradigm for Educational Video Generation Paper • 2510.01174 • Published 7 days ago • 27
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance Paper • 2509.26231 • Published 8 days ago • 17
DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder Paper • 2509.25182 • Published 9 days ago • 33
Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step Paper • 2509.23924 • Published 10 days ago • 7
Rolling Forcing: Autoregressive Long Video Diffusion in Real Time Paper • 2509.25161 • Published 9 days ago • 21
VideoScore2: Think before You Score in Generative Video Evaluation Paper • 2509.22799 • Published 12 days ago • 22
SparseD: Sparse Attention for Diffusion Language Models Paper • 2509.24014 • Published 10 days ago • 29
EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling Paper • 2509.23909 • Published 10 days ago • 26
SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer Paper • 2509.24695 • Published 9 days ago • 38
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention Paper • 2509.24006 • Published 10 days ago • 110
FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing Paper • 2509.22244 • Published 12 days ago • 5
UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models Paper • 2509.21760 • Published 12 days ago • 12