Sparge-attention model zoo

Welcome to Sparge-attention model zoo, this repo contains list of hyperparameters pre-tuned for branch of models.

It was presented in the paper SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference.

Naming of ckpt

The tuned ckpt is often named by following format:${moddel name or type}_${l1}_${pv_l1}.pt, in some cases the pv_l1 will be omitted when not choose to tune pv. The larger l1 and pv_l1 make model more sparse, but may sacrifice output quality.

Overview

model name tuned ckpt dir
CogVideoX-2b cogvideox-2b
want2v-1.3b want2v-1.3B

Per model detail

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support