Unsloth AI

Team

company

Verified

https://unsloth.ai

UnslothAI

unslothai

Activity Feed

AI & ML interests

Open Source AI 💚

Recent Activity

danielhanchen new activity 1 day ago

unsloth/GLM-4.7-GGUF:Original FP8 Weights

danielhanchen new activity 3 days ago

unsloth/Qwen-Image-2512-GGUF:colab notebook link to test the model

danielhanchen new activity 3 days ago

unsloth/Qwen-Image-2512-GGUF:New tool to benchmark GGUF files

View all activity

danielhanchen

in unsloth/GLM-4.7-GGUF 1 day ago

Original FP8 Weights

#9 opened 8 days ago by

Ano-Nimus

danielhanchen

in unsloth/Qwen-Image-2512-GGUF 3 days ago

colab notebook link to test the model

#6 opened 3 days ago by

moon005

New tool to benchmark GGUF files

#7 opened 3 days ago by

Nerdsking

danielhanchen

in unsloth/MiniMax-M2.1-GGUF 4 days ago

Original Precision GGUF?

👍 1

#6 opened 8 days ago by

Ano-Nimus

danielhanchen

in unsloth/Qwen-Image-2512-GGUF 4 days ago

Qwen Image was train at 1328x1328

#4 opened 4 days ago by

tech77

danielhanchen

updated 5 models 4 days ago

danielhanchen

posted an update 5 days ago

Post

494

Run Qwen-Image-2512, the new SOTA text-to-image model! 💜

It's the top performing open diffusion model and has more realistic + accurate images/text.

Run locally with 14GB RAM via our Dynamic GGUF: unsloth/Qwen-Image-2512-GGUF

Guide: https://unsloth.ai/docs/models/qwen-image-2512

2 replies

danielhanchen

posted an update 13 days ago

Post

3805

You can now run GLM-4.7, the new 355B parameter SOTA model on your local device (128GB RAM).✨

The model achieves SOTA performance on coding, agentic and chat benchmarks.

GGUF: unsloth/GLM-4.7-GGUF
Guide: https://docs.unsloth.ai/models/glm-4.7

3 replies

danielhanchen

posted an update 18 days ago

Post

2281

Google releases FunctionGemma, a new 270M parameter model that runs on just 0.5 GB RAM.✨

Built for tool-calling, run locally on your phone at 50+ tokens/s, or fine-tune with Unsloth & deploy to your phone.

GGUF: unsloth/functiongemma-270m-it-GGUF
Docs + Notebook: https://docs.unsloth.ai/models/functiongemma

2 replies

danielhanchen

posted an update 21 days ago

Post

5320

NVIDIA releases Nemotron 3 Nano, a new 30B hybrid reasoning model! 🔥

Has 1M context window & best in class performance for SWE-Bench, reasoning & chat. Run the MoE model locally with 24GB RAM.

GGUF: unsloth/Nemotron-3-Nano-30B-A3B-GGUF
💚 Step-by-step Guide: https://docs.unsloth.ai/models/nemotron-3

1 reply

danielhanchen

posted an update 25 days ago

Post

2049

Mistral's new SOTA coding models Devstral 2 can now be Run locally! (25GB RAM) 🐱
We fixed the chat template, so performance should be much better now!
24B: unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF
123B: unsloth/Devstral-2-123B-Instruct-2512-GGUF

🧡Step-by-step Guide: https://docs.unsloth.ai/models/devstral-2

danielhanchen

posted an update about 1 month ago

Post

3703

Mistral's new Ministral 3 models can now be Run & Fine-tuned locally! (16GB RAM)
Ministral 3 have vision support and the best-in-class performance for their sizes.
14B Instruct GGUF: unsloth/Ministral-3-14B-Instruct-2512-GGUF
14B Reasoning GGUF: unsloth/Ministral-3-14B-Reasoning-2512-GGUF

🐱 Step-by-step Guide: https://docs.unsloth.ai/new/ministral-3
All GGUFs, BnB, FP8 etc. variants uploads: https://huggingface.co/collections/unsloth/ministral-3

3 replies

danielhanchen

posted an update about 1 month ago

Post

8481

Qwen3-Next can now be Run locally! (30GB RAM)
Instruct GGUF: unsloth/Qwen3-Next-80B-A3B-Instruct-GGUF

The models come in Thinking and Instruct versions and utilize a new architecture, allowing it to have ~10x faster inference than Qwen32B.
💜 Step-by-step Guide: https://docs.unsloth.ai/models/qwen3-next

Thinking GGUF: unsloth/Qwen3-Next-80B-A3B-Thinking-GGUF

danielhanchen

posted an update about 2 months ago

Post

4385

You can now run Kimi K2 Thinking locally with our Dynamic 1-bit GGUFs: unsloth/Kimi-K2-Thinking-GGUF

We shrank the 1T model to 245GB (-62%) & retained ~85% of accuracy on Aider Polyglot. Run on >247GB RAM for fast inference.

We also collaborated with the Moonshot AI Kimi team on a system prompt fix! 🥰

Guide + fix details: https://docs.unsloth.ai/models/kimi-k2-thinking-how-to-run-locally

danielhanchen

posted an update 5 months ago

Post

6586

Run DeepSeek-V3.1 locally on 170GB RAM with Dynamic 1-bit GGUFs!🐋
GGUFs: unsloth/DeepSeek-V3.1-GGUF

The 715GB model gets reduced to 170GB (-80% size) by smartly quantizing layers.

The 1-bit GGUF passes all our code tests & we fixed the chat template for llama.cpp supported backends.

Guide: https://docs.unsloth.ai/basics/deepseek-v3.1

danielhanchen

posted an update 5 months ago

Post

5649

Run OpenAI's new gpt-oss models locally with Unsloth GGUFs! 🔥🦥
20b GGUF: unsloth/gpt-oss-20b-GGUF
120b GGUF: unsloth/gpt-oss-120b-GGUF

Model will run on 14GB RAM for 20b and 66GB for 120b.

2 replies

AI & ML interests

Recent Activity

Team members 2

unsloth's activity

Original FP8 Weights

colab notebook link to test the model

New tool to benchmark GGUF files

Original Precision GGUF?

Qwen Image was train at 1328x1328