Qwen/Qwen2.5-VL-7B-Instruct (Quantized)

Description

This model is a quantized version of the original model Qwen/Qwen2.5-VL-7B-Instruct.

It's quantized using the BitsAndBytes library to 4-bit using the bnb-my-repo space.

Quantization Details

  • Quantization Type: int4
  • bnb_4bit_quant_type: nf4
  • bnb_4bit_use_double_quant: True
  • bnb_4bit_compute_dtype: bfloat16
  • bnb_4bit_quant_storage: uint8
Downloads last month
0
Safetensors
Model size
3.91B params
Tensor type
F32
BF16
U8
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for medmekk/Qwen2.5-VL-7B-Instruct-2

Quantized
(23)
this model