Upload folder using huggingface_hub
Browse files
.gitattributes
CHANGED
@@ -33,3 +33,7 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
DeepHermes-3-Mistral-24B-Preview-q4.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
+
DeepHermes-3-Mistral-24B-Preview-q5.gguf filter=lfs diff=lfs merge=lfs -text
|
38 |
+
DeepHermes-3-Mistral-24B-Preview-q6.gguf filter=lfs diff=lfs merge=lfs -text
|
39 |
+
DeepHermes-3-Mistral-24B-Preview-q8.gguf filter=lfs diff=lfs merge=lfs -text
|
DeepHermes-3-Mistral-24B-Preview-q4.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d53c1bbeedf66712de8faf5f153dcb9a19a6ccb03cde6dacd8eadb602e720238
|
3 |
+
size 13441842528
|
DeepHermes-3-Mistral-24B-Preview-q5.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f1810503fd7348b1c88b870639d217aad45c22db37dcad5d498b5b90bf4a0819
|
3 |
+
size 16304458848
|
DeepHermes-3-Mistral-24B-Preview-q6.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9f0d295014808940b144a4455e0b9db1ec805722481fe11d227112f913e5c137
|
3 |
+
size 19345988704
|
DeepHermes-3-Mistral-24B-Preview-q8.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:87e48145a902e5c5eb3fce42535ee3166f27ef993cc29f61ae390101cdd72e44
|
3 |
+
size 25054844512
|
README.md
ADDED
@@ -0,0 +1,24 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# GGUF Quantized Models
|
2 |
+
|
3 |
+
This repository contains GGUF quantized versions of language models for efficient inference.
|
4 |
+
|
5 |
+
## Files
|
6 |
+
|
7 |
+
The following GGUF files are available in this repository:
|
8 |
+
|
9 |
+
- `DeepHermes-3-Mistral-24B-Preview-q4.gguf` (13G)
|
10 |
+
- `DeepHermes-3-Mistral-24B-Preview-q5.gguf` (16G)
|
11 |
+
- `DeepHermes-3-Mistral-24B-Preview-q6.gguf` (19G)
|
12 |
+
- `DeepHermes-3-Mistral-24B-Preview-q8.gguf` (24G)
|
13 |
+
|
14 |
+
## Usage
|
15 |
+
|
16 |
+
These models are compatible with [llama.cpp](https://github.com/ggerganov/llama.cpp) and similar frameworks.
|
17 |
+
|
18 |
+
Example usage with llama.cpp:
|
19 |
+
```bash
|
20 |
+
./main -m /path/to/model.gguf -p "Hello, I am a language model" -n 128
|
21 |
+
```
|
22 |
+
|
23 |
+
## Upload Information
|
24 |
+
Files were uploaded on Tue Mar 11 03:48:02 PDT 2025
|