mlx-community
/

granite-4.0-h-tiny-4bit

Text Generation

granitemoehybrid

4-bit precision

Model card Files Files and versions

adrgrondin commited on 8 days ago

Commit

02c8783

·

verified ·

1 Parent(s): 980061c

Update README.md

Files changed (1) hide show

README.md +29 -1

README.md CHANGED Viewed

@@ -5,6 +5,34 @@ tags:
 - language
 - granite-4.0
 - mlx
 pipeline_tag: text-generation
-base_model: ibm-granite/granite-4.0-h-tiny
 ---

 - language
 - granite-4.0
 - mlx
+base_model: ibm-granite/granite-4.0-h-micro
 pipeline_tag: text-generation
 ---
+# mlx-community/granite-4.0-h-tiny-4bit
+This model [mlx-community/granite-4.0-h-tiny-4bit](https://huggingface.co/mlx-community/granite-4.0-h-tiny-4bit) was
+converted to MLX format from [ibm-granite/granite-4.0-h-tiny](https://huggingface.co/ibm-granite/granite-4.0-h-tiny)
+using mlx-lm version **0.28.2**.
+## Use with mlx
+```bash
+pip install mlx-lm
+```
+```python
+from mlx_lm import load, generate
+model, tokenizer = load("mlx-community/granite-4.0-h-tiny-4bit")
+prompt = "hello"
+if tokenizer.chat_template is not None:
+    messages = [{"role": "user", "content": prompt}]
+    prompt = tokenizer.apply_chat_template(
+        messages, add_generation_prompt=True
+    )
+response = generate(model, tokenizer, prompt=prompt, verbose=True)
+```