quantization / build.toml

Commit History

Fix undefined symbol on CUDA 11.8
3313895

danieldk HF Staff commited on

Set CUDA minimum versions for hopper/blackwell cutlass kernels
3370704

danieldk HF Staff commited on

Temporarily disable ROCm
d5fd23d

danieldk HF Staff commited on

Sync to vLLM 20250627
8aa00a3

danieldk HF Staff commited on

Sync capabilities with upstream
59b2fef

danieldk HF Staff commited on

Add support for ROCm
3c8bb73

danieldk HF Staff commited on

Update for build.toml changes
116e562

danieldk HF Staff commited on

Sync with vLLM
0da5bf5

danieldk HF Staff commited on

Add full Marlin support and tests for Marlin/CUTLASS
165b25c

danieldk HF Staff commited on

Add GPTQ-Marlin
c31b5ce

danieldk HF Staff commited on

Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
5c6fb68

danieldk HF Staff commited on

Add cutlass_w8a8
b4cad21

danieldk HF Staff commited on