LLM-Drop
Collection
Model weights of paper "Uncovering the Redundancy in Transformers via a Unified Study of Layer Dropping (TMLR)". ⢠19 items ⢠Updated ⢠6
Efficient and adaptive foundation models across language and multimodal intelligence.
Drop-Then-Recovery: How Redundant Are Vision-Language-Action Models?
Demystifying When Pruning Works via Representation Hierarchies