mridul3301
/

mistral-7b-finetuned-gguf

3 format of quantization:

Converted the safetensors to GGUF for inference in CPU using llama_cpp

GGUF

Model size

7.24B params

Architecture

llama

Hardware compatibility

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support