Update README.md
Browse files
README.md
CHANGED
@@ -18,6 +18,10 @@ tags:
|
|
18 |
This model was converted to GGUF format from [`arcee-ai/Homunculus`](https://huggingface.co/arcee-ai/Homunculus) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
19 |
Refer to the [original model card](https://huggingface.co/arcee-ai/Homunculus) for more details on the model.
|
20 |
|
|
|
|
|
|
|
|
|
21 |
## Use with llama.cpp
|
22 |
Install llama.cpp through brew (works on Mac and Linux)
|
23 |
|
|
|
18 |
This model was converted to GGUF format from [`arcee-ai/Homunculus`](https://huggingface.co/arcee-ai/Homunculus) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
19 |
Refer to the [original model card](https://huggingface.co/arcee-ai/Homunculus) for more details on the model.
|
20 |
|
21 |
+
---
|
22 |
+
Homunculus is a 12 billion-parameter instruction model distilled from Qwen3-235B onto the Mistral-Nemo backbone. It was purpose-built to preserve Qwen’s two-mode interaction style—/think (deliberate chain-of-thought) and /nothink (concise answers)—while running on a single consumer GPU.
|
23 |
+
|
24 |
+
---
|
25 |
## Use with llama.cpp
|
26 |
Install llama.cpp through brew (works on Mac and Linux)
|
27 |
|