matrixportal commited on
Commit
f49e208
·
verified ·
1 Parent(s): 0dbd203

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +35 -0
README.md ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - ytu-ce-cosmos/Turkish-Llama-8b-Instruct-v0.1
4
+ - meta-llama/Llama-3.1-8B
5
+ - meta-llama/Llama-3.1-8B-Instruct
6
+ tags:
7
+ - merge
8
+ - mergekit
9
+ - lazymergekit
10
+ - ytu-ce-cosmos/Turkish-Llama-8b-Instruct-v0.1
11
+ - meta-llama/Llama-3.1-8B
12
+ - meta-llama/Llama-3.1-8B-Instruct
13
+ ---
14
+
15
+ # Hibrid-Llama-Linear GGUF Quantized Models
16
+
17
+ ## Technical Details
18
+ - **Quantization Tool:** llama.cpp
19
+ - **Version:** version: 5126 (307bfa25)
20
+
21
+ ## Model Information
22
+ - **Base Model:** [matrixportal/Hibrid-Llama-Linear](https://huggingface.co/matrixportal/Hibrid-Llama-Linear)
23
+ - **Quantized by:** [matrixportal](https://huggingface.co/matrixportal)
24
+
25
+ ## Available Files
26
+ | 🚀 Download | 🔢 Type | 📝 Description |
27
+ |------------|---------|---------------|
28
+ | [Download](https://huggingface.co/matrixportal/Hibrid-Llama-Linear-GGUF/resolve/main/hibrid-llama-linear.q3_k_m.gguf) | Q3 K M | Small, acceptable quality |
29
+ | [Download](https://huggingface.co/matrixportal/Hibrid-Llama-Linear-GGUF/resolve/main/hibrid-llama-linear.q4_0.gguf) | Q4 0 | Standard 4-bit (fast on ARM) |
30
+ | [Download](https://huggingface.co/matrixportal/Hibrid-Llama-Linear-GGUF/resolve/main/hibrid-llama-linear.q4_k_m.gguf) | Q4 K M | 4-bit balanced (recommended default) |
31
+ | [Download](https://huggingface.co/matrixportal/Hibrid-Llama-Linear-GGUF/resolve/main/hibrid-llama-linear.q5_k_m.gguf) | Q5 K M | 5-bit best (recommended HQ option) |
32
+ | [Download](https://huggingface.co/matrixportal/Hibrid-Llama-Linear-GGUF/resolve/main/hibrid-llama-linear.q6_k.gguf) | Q6 K | 6-bit near-perfect (premium quality) |
33
+ | [Download](https://huggingface.co/matrixportal/Hibrid-Llama-Linear-GGUF/resolve/main/hibrid-llama-linear.q8_0.gguf) | Q8 0 | 8-bit maximum (overkill for most) |
34
+
35
+ 💡 **Q4 K M** provides the best balance for most use cases