mlx-community
/

Kimi-K2-Instruct-0905-mlx-DQ3_K_M

Text Generation

4-bit precision

Model card Files Files and versions

bibproj commited on 16 days ago

Commit

1b09598

·

verified ·

1 Parent(s): 441a405

Update README.md

Files changed (1) hide show

README.md +0 -8

README.md CHANGED Viewed

@@ -21,16 +21,8 @@ This model [mlx-community/Kimi-K2-Instruct-0905-mlx-DQ3_K_M](https://huggingface
 converted to MLX format from [moonshotai/Kimi-K2-Instruct-0905](https://huggingface.co/moonshotai/Kimi-K2-Instruct-0905)
 using mlx-lm version **0.26.3**.
----
-## Who is this for?
 This is created for people using a single Apple Mac Studio M3 Ultra with 512 GB. The 4-bit version of Kimi K2 does not fit. Using research results, we aim to get 4-bit performance from a slightly smaller and smarter quantization. It should also not be so large that it leaves no memory for a useful context window.
----
-## Use this model with mlx
 ```bash
 pip install mlx-lm

 converted to MLX format from [moonshotai/Kimi-K2-Instruct-0905](https://huggingface.co/moonshotai/Kimi-K2-Instruct-0905)
 using mlx-lm version **0.26.3**.
 This is created for people using a single Apple Mac Studio M3 Ultra with 512 GB. The 4-bit version of Kimi K2 does not fit. Using research results, we aim to get 4-bit performance from a slightly smaller and smarter quantization. It should also not be so large that it leaves no memory for a useful context window.
 ```bash
 pip install mlx-lm