teknium commited on
Commit
2f286e5
·
verified ·
1 Parent(s): 7879052

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,7 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ DeepHermes-3-Mistral-24B-Preview-q4.gguf filter=lfs diff=lfs merge=lfs -text
37
+ DeepHermes-3-Mistral-24B-Preview-q5.gguf filter=lfs diff=lfs merge=lfs -text
38
+ DeepHermes-3-Mistral-24B-Preview-q6.gguf filter=lfs diff=lfs merge=lfs -text
39
+ DeepHermes-3-Mistral-24B-Preview-q8.gguf filter=lfs diff=lfs merge=lfs -text
DeepHermes-3-Mistral-24B-Preview-q4.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d53c1bbeedf66712de8faf5f153dcb9a19a6ccb03cde6dacd8eadb602e720238
3
+ size 13441842528
DeepHermes-3-Mistral-24B-Preview-q5.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f1810503fd7348b1c88b870639d217aad45c22db37dcad5d498b5b90bf4a0819
3
+ size 16304458848
DeepHermes-3-Mistral-24B-Preview-q6.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9f0d295014808940b144a4455e0b9db1ec805722481fe11d227112f913e5c137
3
+ size 19345988704
DeepHermes-3-Mistral-24B-Preview-q8.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:87e48145a902e5c5eb3fce42535ee3166f27ef993cc29f61ae390101cdd72e44
3
+ size 25054844512
README.md ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # GGUF Quantized Models
2
+
3
+ This repository contains GGUF quantized versions of language models for efficient inference.
4
+
5
+ ## Files
6
+
7
+ The following GGUF files are available in this repository:
8
+
9
+ - `DeepHermes-3-Mistral-24B-Preview-q4.gguf` (13G)
10
+ - `DeepHermes-3-Mistral-24B-Preview-q5.gguf` (16G)
11
+ - `DeepHermes-3-Mistral-24B-Preview-q6.gguf` (19G)
12
+ - `DeepHermes-3-Mistral-24B-Preview-q8.gguf` (24G)
13
+
14
+ ## Usage
15
+
16
+ These models are compatible with [llama.cpp](https://github.com/ggerganov/llama.cpp) and similar frameworks.
17
+
18
+ Example usage with llama.cpp:
19
+ ```bash
20
+ ./main -m /path/to/model.gguf -p "Hello, I am a language model" -n 128
21
+ ```
22
+
23
+ ## Upload Information
24
+ Files were uploaded on Tue Mar 11 03:48:02 PDT 2025