graelo
/

Qwen2.5-7B-Instruct-1M-AWQ

4-bit precision

Model card Files Files and versions

Quantized from Qwen/Qwen2.5-7B-Instruct-1M down to 4 bits, GEMM

Downloads last month: 43

Safetensors

Model size

1.96B params

Tensor type

I32

·

BF16

·

F16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for graelo/Qwen2.5-7B-Instruct-1M-AWQ

Base model

Qwen/Qwen2.5-7B

Finetuned

Qwen/Qwen2.5-7B-Instruct-1M

Quantized

(71)

this model