Llama2 quantized 4bit model with bitsandbytes.

Downloads last month
47
Safetensors
Model size
3.6B params
Tensor type
F32
FP16
U8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support

Model tree for corneille97/llama-2-7b-4bits-turbo

Quantizations
1 model