mlx-community/Sky-T1-32B-Preview-8bit

The Model mlx-community/Sky-T1-32B-Preview-8bit was converted to MLX format from NovaSky-AI/Sky-T1-32B-Preview using mlx-lm version 0.21.0 by Focused.

Focused Logo

Use with mlx

pip install mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("mlx-community/Sky-T1-32B-Preview-8bit")

prompt = "hello"

if tokenizer.chat_template is not None:
    messages = [{"role": "user", "content": prompt}]
    prompt = tokenizer.apply_chat_template(
        messages, add_generation_prompt=True
    )

response = generate(model, tokenizer, prompt=prompt, verbose=True)

Focused is a technology company at the forefront of AI-driven development, empowering organizations to unlock the full potential of artificial intelligence. From integrating innovative models into existing systems to building scalable, modern AI infrastructures, we specialize in delivering tailored, incremental solutions that meet you where you are. Curious how we can help with your AI next project? Get in Touch

Focused Logo

Downloads last month
92
Safetensors
Model size
9.22B params
Tensor type
FP16
·
U32
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for mlx-community/Sky-T1-32B-Preview-8bit

Base model

Qwen/Qwen2.5-32B
Quantized
(20)
this model

Datasets used to train mlx-community/Sky-T1-32B-Preview-8bit