Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

datatab
/
Yugo45-GPT-Quantized-GGUF

Transformers
GGUF
Serbian
mistral
text-generation-inference
Model card Files Files and versions Community
Yugo45-GPT-Quantized-GGUF
Ctrl+K
Ctrl+K
  • 1 contributor
History: 17 commits
datatab's picture
datatab
q5_k_m: Recommended. Uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q5_K
71fc4da verified about 1 year ago
  • .gitattributes
    1.94 kB
    q5_k_m: Recommended. Uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q5_K about 1 year ago
  • README.md
    3.7 kB
    Update README.md about 1 year ago
  • Yugo45-GPT-Quantized-GGUF.Q3_K_M.gguf
    3.52 GB
    LFS
    q3_k_m: Uses Q4_K for the attention.wv, attention.wo, and feed_forward.w2 tensors, else Q3_K about 1 year ago
  • Yugo45-GPT-Quantized-GGUF.Q4_K_M.gguf
    4.37 GB
    LFS
    q4_k_m: Recommended. Uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q4_K about 1 year ago
  • Yugo45-GPT-Quantized-GGUF.Q5_K_M.gguf
    5.13 GB
    LFS
    q5_k_m: Recommended. Uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q5_K about 1 year ago
  • config.json
    31 Bytes
    Create config.json about 1 year ago