huggingkot
commited on
Commit
·
06b7fcc
1
Parent(s):
b9ed71e
add files
Browse files- .gitattributes +1 -0
- README.md +17 -0
- config.json +3 -0
- generation_config.json +3 -0
- model.safetensors +3 -0
- special_tokens_map.json +3 -0
- tokenizer.json +3 -0
- tokenizer.model +3 -0
- tokenizer_config.json +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
*.json filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
@@ -0,0 +1,17 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
|
2 |
+
---
|
3 |
+
base_model:
|
4 |
+
- TheDrummer/Gemmasutra-Small-4B-v1
|
5 |
+
---
|
6 |
+
|
7 |
+
This is a converted weight from [Gemmasutra-Small-4B-v1](https://huggingface.co/TheDrummer/Gemmasutra-Small-4B-v1) model in [unsloth 4-bit dynamic quant](https://archive.is/EFz7P) using this [collab notebook](https://colab.research.google.com/drive/1P23C66j3ga49kBRnDNlmRce7R_l_-L5l?usp=sharing).
|
8 |
+
|
9 |
+
## About this Conversion
|
10 |
+
|
11 |
+
This conversion uses **Unsloth** to load the model in **4-bit** format and force-save it in the same **4-bit** format.
|
12 |
+
|
13 |
+
### How 4-bit Quantization Works
|
14 |
+
- The actual **4-bit quantization** is handled by **BitsAndBytes (bnb)**, which works under **Torch** via **AutoGPTQ** or **BitsAndBytes**.
|
15 |
+
- **Unsloth** acts as a wrapper, simplifying and optimizing the process for better efficiency.
|
16 |
+
|
17 |
+
This allows for reduced memory usage and faster inference while keeping the model compact.
|
config.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d8861836cb634dda1abc43dbfb8b6a8e5ae41595c6c63a5a5c7f35430b437793
|
3 |
+
size 1318
|
generation_config.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d5a04929dc6198b832d2ea9264a0a66165bc9a474faf50a6eddf46fcaab378cd
|
3 |
+
size 230
|
model.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:175b1b3f74ce1a11b9ea46ab55678671d531cc290a9b2f92a697fc4fb716b202
|
3 |
+
size 2867911260
|
special_tokens_map.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:baec30ea10906f16adb8c18af7a34023002c1746542612b8b41c9f09e1351351
|
3 |
+
size 636
|
tokenizer.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5f7eee611703c5ce5d1eee32d9cdcfe465647b8aff0c1dfb3bed7ad7dbb05060
|
3 |
+
size 34362873
|
tokenizer.model
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:61a7b147390c64585d6c3543dd6fc636906c9af3865a5548f27f31aee1d4c8e2
|
3 |
+
size 4241003
|
tokenizer_config.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:89d63bf32272ecb82d2bbcf8f90bbf1ebc2b7aa4fedfc1dfee099541628726fe
|
3 |
+
size 47025
|