TokenButler Collection TokenButler -- Predict token importance for all heads across the transformer in the first layer itself. Enable fine-grained token sparsity! • 6 items • Updated 2 days ago • 2