YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

7.Boyut-c4ai GGUF Quantized Models

Technical Details

  • Quantization Tool: llama.cpp
  • Version: version: 5318 (15e03282)

Model Information

Available Files

🚀 Download 🔢 Type 📝 Description
Download Q4 K M 4-bit balanced (recommended default)

💡 Q4 K M provides the best balance for most use cases

Downloads last month
6
GGUF
Model size
8.03B params
Architecture
cohere2
Hardware compatibility
Log In to view the estimation

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support