Llama-3.2-1B-chatml-tool-v4 GGUF Quantized Models

Technical Details

  • Quantization Tool: llama.cpp
  • Version: version: 5092 (d3bd7193)

Model Information

Available Files

💡 Q4_K_M provides the best balance for most use cases

Downloads last month
115
GGUF
Model size
1.24B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for matrixportal/Llama-3.2-1B-chatml-tool-v4-GGUF

Datasets used to train matrixportal/Llama-3.2-1B-chatml-tool-v4-GGUF