YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Just a quick quantization of Qwen2.5-VL-7B-Instruct using llm-compressor. Used the example script with a MAX_SEQUENCE of 32768, truncation disabled (hit a bug in VLLM with this in the tokenizer), and NUM_CALIBRATION_SAMPLES of 512.

Downloads last month
11
Safetensors
Model size
2.63B params
Tensor type
I64
I32
BF16
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support