datatab
/

Yugo45-GPT-Quantized-GGUF

text-generation-inference

Model card Files Files and versions Community

Yugo45-GPT-Quantized-GGUF

Ctrl+K

Ctrl+K

1 contributor

History: 17 commits

datatab's picture

q5_k_m: Recommended. Uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q5_K

71fc4da verified over 1 year ago

.gitattributes

1.94 kB

q5_k_m: Recommended. Uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q5_K over 1 year ago
README.md

3.7 kB

Update README.md over 1 year ago
Yugo45-GPT-Quantized-GGUF.Q3_K_M.gguf

3.52 GB
LFS

q3_k_m: Uses Q4_K for the attention.wv, attention.wo, and feed_forward.w2 tensors, else Q3_K over 1 year ago
Yugo45-GPT-Quantized-GGUF.Q4_K_M.gguf

4.37 GB
LFS

q4_k_m: Recommended. Uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q4_K over 1 year ago
Yugo45-GPT-Quantized-GGUF.Q5_K_M.gguf

5.13 GB
LFS

q5_k_m: Recommended. Uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q5_K over 1 year ago
config.json

31 Bytes

Create config.json over 1 year ago