Curated MLX-ready quantized LLMs that run fast on Apple Silicon (and some on iOS). Every card lists Bits · Group size · Peak UM (GB) · Stable context.
-
mlx-community/Apriel-1.5-15b-Thinker-3bit-MLX
Image-Text-to-Text • Updated • 42 -
mlx-community/Apriel-1.5-15b-Thinker-6bit-MLX
Image-Text-to-Text • Updated • 93 -
mlx-community/granite-4.0-h-tiny-3bit-MLX
Text Generation • 0.9B • Updated • 122 • 2 -
mlx-community/granite-4.0-tiny-preview-4bit
Text Generation • 1B • Updated • 163