An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion Paper • 2408.03178 • Published Aug 6, 2024 • 41
VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers Paper • 2408.17131 • Published Aug 30, 2024 • 11
LatentSync: Audio Conditioned Latent Diffusion Models for Lip Sync Paper • 2412.09262 • Published Dec 12, 2024 • 1
SegDT: A Diffusion Transformer-Based Segmentation Model for Medical Imaging Paper • 2507.15595 • Published 26 days ago • 4