Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning Paper • 2508.04581 • Published Aug 6 • 4
COSPADI: Compressing LLMs via Calibration-Guided Sparse Dictionary Learning Paper • 2509.22075 • Published 6 days ago • 20
COSPADI: Compressing LLMs via Calibration-Guided Sparse Dictionary Learning Paper • 2509.22075 • Published 6 days ago • 20
Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning Paper • 2508.04581 • Published Aug 6 • 4
COSPADI: Compressing LLMs via Calibration-Guided Sparse Dictionary Learning Paper • 2509.22075 • Published 6 days ago • 20 • 2
ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations Paper • 2505.02819 • Published May 5 • 25 • 4
ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations Paper • 2505.02819 • Published May 5 • 25
ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations Paper • 2505.02819 • Published May 5 • 25