SparseD: Sparse Attention for Diffusion Language Models Paper • 2509.24014 • Published 11 days ago • 29
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention Paper • 2509.24006 • Published 11 days ago • 110