Halley AI

company

Verified

https://halleyai.ai/

AI & ML interests

Text Generation & Chat Assistants; Model Compression & Quantization (Q4/Q6/Q8, gs32); Inference & Serving (on-prem, low-latency); RAG / Retrieval; Agents & Tool Use; Distillation / LoRA / Fine-tuning

Recent Activity

sebastavar updated a Space 2 days ago

halley-ai/README

sebastavar updated a model 2 days ago

halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-5bit-gs32

sebastavar updated a model 2 days ago

halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-6bit-gs64

View all activity

halley-ai 's models 9

halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-5bit-gs32

Text Generation • 80B • Updated 2 days ago • 46 • 1

halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-6bit-gs64

Text Generation • 80B • Updated 2 days ago • 31 • 1

halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-4bit-gs64

Text Generation • 80B • Updated 2 days ago • 35 • 1

halley-ai/gpt-oss-120b-MLX-bf16

Text Generation • 117B • Updated 13 days ago • 293 • 1

halley-ai/gpt-oss-120b-MLX-8bit-gs32

Text Generation • 117B • Updated 13 days ago • 215 • 1

halley-ai/gpt-oss-120b-MLX-6bit-gs64

Text Generation • 117B • Updated 13 days ago • 168 • 1

halley-ai/gpt-oss-20b-MLX-5bit-gs32

Text Generation • 21B • Updated 13 days ago • 217 • 1

halley-ai/gpt-oss-20b-MLX-6bit-gs32

Text Generation • 21B • Updated Aug 18 • 150 • 1

halley-ai/gpt-oss-20b-MLX-4bit-gs32

Text Generation • 21B • Updated Aug 18 • 186 • 1