matrixportal
/

Llama-3.2-1B-chatml-tool-v4-GGUF

Text Generation

Model card Files Files and versions

Llama-3.2-1B-chatml-tool-v4 GGUF Quantized Models

Technical Details

Quantization Tool: llama.cpp
Version: version: 5092 (d3bd7193)

Model Information

Base Model: minpeter/Llama-3.2-1B-chatml-tool-v4
Quantized by: matrixportal

Available Files

llama-3.2-1b-chatml-tool-v4.q8_0.gguf (1259.88MB)

💡 Q4_K_M provides the best balance for most use cases

Downloads last month: 115

GGUF

Model size

1.24B params

Architecture

llama

Hardware compatibility

Log In to view the estimation

8-bit

16-bit

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for matrixportal/Llama-3.2-1B-chatml-tool-v4-GGUF

meta-llama/Llama-3.2-1B

minpeter/QLoRA-Llama-3.2-1B-chatml-tool-v4

Merge model

this model

Datasets used to train matrixportal/Llama-3.2-1B-chatml-tool-v4-GGUF