[CLS] Token Tells Everything Needed for Training-free Efficient MLLMs Paper • 2412.05819 • Published Dec 8, 2024
PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation Paper • 2412.03409 • Published Dec 4, 2024