Triangle104
/

Homunculus-Q8_0-GGUF

reasoning-transfer

Model card Files Files and versions Community

Triangle104 commited on 8 days ago

Commit

2e8d44d

·

verified ·

1 Parent(s): d6ff7a1

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -18,6 +18,10 @@ tags:
 This model was converted to GGUF format from [`arcee-ai/Homunculus`](https://huggingface.co/arcee-ai/Homunculus) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/arcee-ai/Homunculus) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`arcee-ai/Homunculus`](https://huggingface.co/arcee-ai/Homunculus) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/arcee-ai/Homunculus) for more details on the model.
+---
+Homunculus is a 12 billion-parameter instruction model distilled from Qwen3-235B onto the Mistral-Nemo backbone. It was purpose-built to preserve Qwen’s two-mode interaction style—/think (deliberate chain-of-thought) and /nothink (concise answers)—while running on a single consumer GPU.
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)