UQFF
Collection
UQFF models. Examples for each in the model card!
β’
37 items
β’
Updated
β’
18
mistralai/Mistral-Nemo-Instruct-2407, UQFF quantization
Run with mistral.rs. Documentation: UQFF docs.
| Quantization type(s) | Example |
|---|---|
| FP8 | ./mistralrs-server -i plain -m EricB/Mistral-Nemo-Instruct-2407-UQFF --from-uqff mistral-nemo-2407-instruct-f8e4m3.uqff |
| HQQ4 | ./mistralrs-server -i plain -m EricB/Mistral-Nemo-Instruct-2407-UQFF --from-uqff mistral-nemo-2407-instruct-hqq4.uqff |
| HQQ8 | ./mistralrs-server -i plain -m EricB/Mistral-Nemo-Instruct-2407-UQFF --from-uqff mistral-nemo-2407-instruct-hqq8.uqff |
| Q3K | ./mistralrs-server -i plain -m EricB/Mistral-Nemo-Instruct-2407-UQFF --from-uqff mistral-nemo-2407-instruct-q3k.uqff |
| Q4K | ./mistralrs-server -i plain -m EricB/Mistral-Nemo-Instruct-2407-UQFF --from-uqff mistral-nemo-2407-instruct-q4k.uqff |
| Q5K | ./mistralrs-server -i plain -m EricB/Mistral-Nemo-Instruct-2407-UQFF --from-uqff mistral-nemo-2407-instruct-q5k.uqff |
| Q8_0 | ./mistralrs-server -i plain -m EricB/Mistral-Nemo-Instruct-2407-UQFF --from-uqff mistral-nemo-2407-instruct-q8_0.uqff |
Base model
mistralai/Mistral-Nemo-Base-2407