LLM-Drop

university

https://github.com/CASE-Lab-UMD/LLM-Drop

AI & ML interests

Efficient and adaptive foundation models across language and multimodal intelligence.

Recent Activity

shwai-he updated a collection 5 days ago

s1ghhh submitted a paper 5 days ago

Drop-Then-Recovery: How Redundant Are Vision-Language-Action Models?

shwai-he authored a paper 3 months ago

Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

View all activity

Papers

Drop-Then-Recovery: How Redundant Are Vision-Language-Action Models?

Demystifying When Pruning Works via Representation Hierarchies

View all Papers

updated a collection 5 days ago

LLM-Drop

Model weights of paper "Uncovering the Redundancy in Transformers via a Unified Study of Layer Dropping (TMLR)". • 19 items • Updated 5 days ago • 6

submitted a paper to Daily Papers 5 days ago

Drop-Then-Recovery: How Redundant Are Vision-Language-Action Models?

Paper • 2606.27755 • Published 9 days ago • 4

authored 11 papers 3 months ago

Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Paper • 2402.10110 • Published Feb 15, 2024 • 3

What Matters in Transformers? Not All Attention is Needed

Paper • 2406.15786 • Published Jun 22, 2024 • 33

Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts

Paper • 2503.05066 • Published Mar 7, 2025 • 5

SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning

Paper • 2504.10369 • Published Apr 14, 2025 • 2

CogniPair: From LLM Chatbots to Conscious AI Agents -- GNWT-Based Multi-Agent Digital Twins for Social Pairing -- Dating & Hiring Applications

Paper • 2506.03543 • Published Jun 4, 2025 • 1

CoIn: Counting the Invisible Reasoning Tokens in Commercial Opaque LLM APIs

Paper • 2505.13778 • Published May 19, 2025 • 5

Dense Video Understanding with Gated Residual Tokenization

Paper • 2509.14199 • Published Sep 17, 2025 • 3

Understanding and Harnessing Sparsity in Unified Multimodal Models

Paper • 2512.02351 • Published Dec 2, 2025 • 5

Making Large Language Models Efficient Dense Retrievers

Paper • 2512.20612 • Published Dec 23, 2025 • 5

ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model

Paper • 2603.22281 • Published Mar 23 • 20

Demystifying When Pruning Works via Representation Hierarchies

Paper • 2603.24652 • Published Apr 6 • 20

updated a Space 3 months ago

README

updated a collection 3 months ago

LLM-Drop

Model weights of paper "Uncovering the Redundancy in Transformers via a Unified Study of Layer Dropping (TMLR)". • 19 items • Updated 5 days ago • 6

published a Space 3 months ago

README

updated a collection 3 months ago

LLM-Drop

Model weights of paper "Uncovering the Redundancy in Transformers via a Unified Study of Layer Dropping (TMLR)". • 19 items • Updated 5 days ago • 6

published a model 3 months ago

LLM-Drop/BAGEL-MoE-7B-GEN-32to16

Text-to-Image • Updated Apr 10 • 6 • 4