Phi-3 Mini 4K Instruct (ONNX, Raspberry Pi)

Archive: microsoft_Phi-3-mini-4k-instruct_onnx_rpi.tar.gz
Format: Quantized ONNX
Recommended for: ONNX Runtime, Kleidi AI, and similar frameworks

Efficient, quantized ONNX export of Phi-3 Mini 4K Instruct, optimized for local inference on resource-constrained devices like the Raspberry Pi.

📦 Model Overview

Unpack the archive to access the ONNX model and configuration files.

Install ONNX Runtime if you haven't already:

pip install onnxruntime

Run inference with the model:

import onnxruntime as ort

session = ort.InferenceSession("phi-3-mini-4k-instruct.onnx")
# Replace with your input and inference code

Maintainer: Makatia