Qwen/Qwen2.5-VL-7B-Instruct (Quantized)
Description
This model is a quantized version of the original model Qwen/Qwen2.5-VL-7B-Instruct
.
It's quantized using the BitsAndBytes library to 4-bit using the bnb-my-repo space.
Quantization Details
- Quantization Type: int4
- bnb_4bit_quant_type: nf4
- bnb_4bit_use_double_quant: True
- bnb_4bit_compute_dtype: bfloat16
- bnb_4bit_quant_storage: uint8
- Downloads last month
- 0
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.
Model tree for medmekk/Qwen2.5-VL-7B-Instruct-2
Base model
Qwen/Qwen2.5-VL-7B-Instruct