ktoprakucar/gte-Qwen2-1.5B-instruct-Q8-GPTQ · Hugging Face

This is the 8-bit quantized version of Alibaba-NLP/gte-Qwen2-1.5B-instruct by following the example from the AutoGPTQ repository.

Downloads last month: 149

Safetensors

Model size

807M params

Tensor type

I32

·

F16

·

Inference Providers NEW

Sentence Similarity

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ktoprakucar/gte-Qwen2-1.5B-instruct-Q8-GPTQ

Base model

Alibaba-NLP/gte-Qwen2-1.5B-instruct

Quantized

(20)

this model