Revisiting LRP: Positional Attribution as the Missing Ingredient for Transformer Explainability Paper • 2506.02138 • Published Jun 2 • 1
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models Paper • 2301.13826 • Published Jan 31, 2023 • 1
Voice Separation with an Unknown Number of Multiple Speakers Paper • 2003.01531 • Published Feb 29, 2020 • 2
Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation Paper • 2309.16429 • Published Sep 28, 2023 • 11
AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation Paper • 2305.13050 • Published May 22, 2023 • 3