Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features Paper • 2504.00557 • Published Apr 1 • 15
EdgeFusion: On-Device Text-to-Image Generation Paper • 2404.11925 • Published Apr 18, 2024 • 23
LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights Paper • 2404.11936 • Published Apr 18, 2024 • 1
Shortened LLaMA: A Simple Depth Pruning for Large Language Models Paper • 2402.02834 • Published Feb 5, 2024 • 17
A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation Paper • 2304.00471 • Published Apr 2, 2023 • 1
On Architectural Compression of Text-to-Image Diffusion Models Paper • 2305.15798 • Published May 25, 2023 • 4