lokinfey
/

Phi-3.5-moe-mlx-int4

Model card Files Files and versions Community

lokinfey commited on Aug 29, 2024

Commit

a44526a

·

verified ·

1 Parent(s): b119aa7

init README.md

Files changed (1) hide show

README.md +98 -3

README.md CHANGED Viewed

@@ -1,3 +1,98 @@
----
-license: mit
----

+---
+license: mit
+---
+This is a quantized INT4 model based on Apple MLX Framework Phi-3.5-MoE-Instruct. You can deploy it on Apple Silicon devices.
+Installation
+```bash
+pip install -U mlx-lm
+```
+Conversion
+```bash
+python -m mlx_lm.convert --hf-path microsoft/Phi-3.5-MoE-instruct  -q
+```
+Samples
+```python
+from mlx_lm import load, generate
+model, tokenizer = load("./phi-3.5-moe-mlx-int4")
+sys_msg = """You are a helpful AI assistant, you are an agent capable of using a variety of tools to answer a question. Here are a few of the tools available to you:
+- Blog: This tool helps you describe a certain knowledge point and content, and finally write it into Twitter or Facebook style content
+- Translate: This is a tool that helps you translate into any language, using plain language as required
+To use these tools you must always respond in JSON format containing `"tool_name"` and `"input"` key-value pairs. For example, to answer the question, "Build Muliti Agents with MOE models" you must use the calculator tool like so:
+```json
+{
+    "tool_name": "Blog",
+    "input": "Build Muliti Agents with MOE models"
+}
+```
+Or to translate the question "can you introduce yourself in Chinese" you must respond:
+```json
+{
+    "tool_name": "Search",
+    "input": "can you introduce yourself in Chinese"
+}
+```
+Remember just output the final result, ouput in JSON format containing `"agentid"`,`"tool_name"` , `"input"` and `"output"`  key-value pairs .:
+```json
+[
+{   "agentid": "step1",
+    "tool_name": "Blog",
+    "input": "Build Muliti Agents with MOE models",
+    "output": "........."
+},
+{   "agentid": "step2",
+    "tool_name": "Search",
+    "input": "can you introduce yourself in Chinese",
+    "output": "........."
+},
+{
+    "agentid": "final"
+    "tool_name": "Result",
+    "output": "........."
+}
+]
+```
+The users answer is as follows.
+"""
+query ='Write something about Generative AI with MOE , translate it to Chinese'
+prompt = tokenizer.apply_chat_template(
+    [{"role": "system", "content": sys_msg},{"role": "user", "content": query}],
+    tokenize=False,
+    add_generation_prompt=True,
+)
+response = generate(model, tokenizer, prompt=prompt,max_tokens=1024, verbose=True)
+```