https://huggingface.co/mistralai/Mistral-Small-24B-Base-2501

Q6_K_XL: Q6_K weights, F16 output, F16 embed

Fits 24K CTX on a 24GiB GPU

Downloads last month
1
GGUF
Model size
23.6B params
Architecture
llama
Hardware compatibility
Log In to view the estimation
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support