graelo
/

Qwen2.5-14B-Instruct-1M-AWQ

4-bit precision

Model card Files Files and versions

Quantized from Qwen/Qwen2.5-14B-Instruct-1M down to 4 bits, GEMM.

Downloads last month: 1,002

Safetensors

Model size

3.33B params

Tensor type

I32

·

BF16

·

F16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for graelo/Qwen2.5-14B-Instruct-1M-AWQ

Base model

Qwen/Qwen2.5-14B

Finetuned

Qwen/Qwen2.5-14B-Instruct-1M

Quantized

(55)

this model