speakleash
/

Bielik-11B-v2.2-Instruct-Quanto-8bit

Text Generation

text-generation-inference

8-bit precision

Model card Files Files and versions

Remek commited on Aug 27, 2024

Commit

b70cb3e

·

verified ·

1 Parent(s): fca2a55

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -17,6 +17,8 @@ pipeline_tag: text-generation
 # Bielik-11B-v2.2-Instruct-Quanto-8bit
 This model was converted to Quanto format from [SpeakLeash](https://speakleash.org/)'s [Bielik-11B-v.2.2-Instruct](https://huggingface.co/speakleash/Bielik-11B-v2.2-Instruct).
 ## About Quanto
 Optimum Quanto is a pytorch quantization backend for optimum. Model can be loaded using:

 # Bielik-11B-v2.2-Instruct-Quanto-8bit
 This model was converted to Quanto format from [SpeakLeash](https://speakleash.org/)'s [Bielik-11B-v.2.2-Instruct](https://huggingface.co/speakleash/Bielik-11B-v2.2-Instruct).
+**DISCLAIMER: Be aware that quantised models show reduced response quality and possible hallucinations!**
 ## About Quanto
 Optimum Quanto is a pytorch quantization backend for optimum. Model can be loaded using: