README.md · huggingkot/Gemmasutra-Small-4B-v1-bnb-4bit at main

metadata

base_model:
  - TheDrummer/Gemmasutra-Small-4B-v1

About this Conversion

This conversion uses Unsloth to load the model in 4-bit format and force-save it in the same 4-bit format.

The actual 4-bit quantization is handled by BitsAndBytes (bnb), which works under Torch via AutoGPTQ or BitsAndBytes.
Unsloth acts as a wrapper, simplifying and optimizing the process for better efficiency.

This allows for reduced memory usage and faster inference while keeping the model compact.