Update README.md
Browse files
README.md
CHANGED
|
@@ -22,6 +22,7 @@ Only the weights and activations of the linear operators within transformers blo
|
|
| 22 |
|
| 23 |
FP8 compuation is supported on Nvidia GPUs with compute capability > 8.9 (Ada Lovelace, Hopper).
|
| 24 |
|
|
|
|
| 25 |
|
| 26 |
## Use with vLLM
|
| 27 |
|
|
|
|
| 22 |
|
| 23 |
FP8 compuation is supported on Nvidia GPUs with compute capability > 8.9 (Ada Lovelace, Hopper).
|
| 24 |
|
| 25 |
+
**DISCLAIMER: Be aware that quantised models show reduced response quality and possible hallucinations!**
|
| 26 |
|
| 27 |
## Use with vLLM
|
| 28 |
|