Update README.md
Browse files
README.md
CHANGED
@@ -29,7 +29,7 @@ library_name: transformers
|
|
29 |
- Context Length: 32,768 natively and [131,072 tokens with YaRN](https://huggingface.co/Qwen/Qwen3-32B-AWQ#processing-long-texts).
|
30 |
- Quantization: AWQ 4-bit
|
31 |
|
32 |
-
For more details, including benchmark evaluation and inference performance, please refer to our [GitHub](https://github.com/Adlik/model_zoo/
|
33 |
|
34 |
## Quantization
|
35 |
|
|
|
29 |
- Context Length: 32,768 natively and [131,072 tokens with YaRN](https://huggingface.co/Qwen/Qwen3-32B-AWQ#processing-long-texts).
|
30 |
- Quantization: AWQ 4-bit
|
31 |
|
32 |
+
For more details, including benchmark evaluation and inference performance, please refer to our [GitHub](https://github.com/Adlik/model_zoo/blob/qwen3_quant/LLM/models/Qwen3-32B_quantization_tutorial.md).
|
33 |
|
34 |
## Quantization
|
35 |
|