Hanayo: Harnessing Wave-like Pipeline Parallelism for Enhanced Large Model Training Efficiency Paper • 2308.15762 • Published Aug 30, 2023 • 1
HeteGen: Heterogeneous Parallel Inference for Large Language Models on Resource-Constrained Devices Paper • 2403.01164 • Published Mar 2, 2024
DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers Paper • 2403.10266 • Published Mar 15, 2024
SRDiffusion: Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation Paper • 2505.19151 • Published May 25 • 1
SRDiffusion: Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation Paper • 2505.19151 • Published May 25 • 1