add files

Files changed (10) hide show

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.json filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

+---
+base_model:
+- Sao10K/L3-8B-Stheno-v3.2
+---
+This is a converted weight from [L3-8B-Stheno-v3.2](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2) model in [unsloth 4-bit dynamic quant](https://archive.is/EFz7P) using this [collab notebook](https://colab.research.google.com/drive/1P23C66j3ga49kBRnDNlmRce7R_l_-L5l?usp=sharing).
+## About this Conversion
+This conversion uses **Unsloth** to load the model in **4-bit** format and force-save it in the same **4-bit** format.
+### How 4-bit Quantization Works
+- The actual **4-bit quantization** is handled by **BitsAndBytes (bnb)**, which works under **Torch** via **AutoGPTQ** or **BitsAndBytes**.
+- **Unsloth** acts as a wrapper, simplifying and optimizing the process for better efficiency.
+This allows for reduced memory usage and faster inference while keeping the model compact.

config.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:20c17be80716f05b8f9ff110d0cce59c038348bc0756cd4cf5a068a9b85d5cfb
+size 1180

generation_config.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:b39ca5e2fe39ef609e19dd0a74736bb2d23cabede6b48601a14b21ec3dafd9cd
+size 169

model-00001-of-00002.safetensors ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:1b2ffc22be10d759a937b179ea164fe9eea60f5c74dad460c8aff6e5e771c4cf
+size 4652072855

model-00002-of-00002.safetensors ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:f963ac3fe05ea53b795ce4e56a5e003bb5a03466da8aaf9bcf69347a59de1ae9
+size 1050673280

model.safetensors.index.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:491dbd938157d23c5e24670dc15cb0df5da822d543d48f46e4f4814b9a3fd70d
+size 132271

special_tokens_map.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:6cc5fa1d5ef35562b5f838a7301eec05b5c1fa04681d725a0ebfb92bc318cd59
+size 345

tokenizer.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:3c5cf44023714fb39b05e71e425f8d7b92805ff73f7988b083b8c87f0bf87393
+size 17209961

tokenizer_config.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:2f72bef96a9bc260dc254c391f05025789bfe7eab29fcec468115fc5fe8a56ed
+size 51055