Lewdiculous's picture
Update README.md
d75edd3 verified
---
license: apache-2.0
---
GGUF-IQ-Imatrix quants for NLPark/Test1_SLIDE as requested in [#27](https://huggingface.co/Lewdiculous/Model-Requests/discussions/27).
> [!IMPORTANT]
> **Updated!**
> These quants have been redone with the fixes from [llama.cpp/pull/6920](https://github.com/ggerganov/llama.cpp/pull/6920) in mind. <br>
> Use **KoboldCpp version 1.64** or higher.
> [!WARNING]
> Recommended presets [here](https://huggingface.co/Lewdiculous/Model-Requests/tree/main/data/presets/cope-llama-3-0.1) or [here](https://huggingface.co/Virt-io/SillyTavern-Presets). <br>
> Use the latest version of KoboldCpp. **Use the provided presets.** <br>
> This is all still highly experimental, modified configs were used to avoid the tokenizer issues.
"Due to the poor performance of Test0 in Asian Languages, we trained a new preview model."
"This's NLPark's 8B chat model."
"The chat template of our chat models is similar as Llama3."
![SC.jpg](https://cdn-uploads.huggingface.co/production/uploads/64f3e7c7c30c0cf21382eb69/bn9wHKEsFRTieJ8yDTxOK.jpeg)