Llama-3.2-1B-chatml-tool-v4 GGUF Quantized Models
Technical Details
- Quantization Tool: llama.cpp
- Version: version: 5092 (d3bd7193)
Model Information
- Base Model: minpeter/Llama-3.2-1B-chatml-tool-v4
- Quantized by: matrixportal
Available Files
llama-3.2-1b-chatml-tool-v4.q8_0.gguf
(1259.88MB)
💡 Q4_K_M provides the best balance for most use cases
- Downloads last month
- 115
Hardware compatibility
Log In
to view the estimation
8-bit
16-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support