IBM Granite

Enterprise

company

IBM-Granite

Activity Feed

AI & ML interests

LLMs for language and code + Time series and geospatial foundation models

Recent Activity

ibibrahim updated a collection about 7 hours ago

Granite Experiments

pawasthy updated a model 1 day ago

ibm-granite/granite-embedding-english-r2

pawasthy updated a model 1 day ago

ibm-granite/granite-embedding-small-english-r2

View all activity

Papers

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

View all Papers

Articles

Granite 4.0 Nano: Just how small can you go?

Oct 28, 2025

•

121

ibibrahim

updated a collection about 7 hours ago

Granite Experiments

Collection

Experimental projects under consideration for the Granite family. • 21 items • Updated about 7 hours ago • 15

pawasthy

updated 2 models 1 day ago

ibm-granite/granite-embedding-english-r2

Feature Extraction • 0.1B • Updated 1 day ago • 54.2k • 75

ibm-granite/granite-embedding-small-english-r2

frreiss

in ibm-granite/granite-lib-rag-r1.0 2 days ago

Add more aLoRA files

#5 opened 2 days ago by

frreiss

Models and configs for Ollama backend

#4 opened 13 days ago by

kndtran

gabegoodhart

in ibm-granite/granite-4.0-h-tiny 2 days ago

Wow I am not sure what to say about this result? I don't think I did anything 2 strange to cause this, and I swear I was not messing around with it or trying to stage a bug.

#7 opened about 2 months ago by

Culling

gabegoodhart

in ibm-granite/granite-docling-258M 2 days ago

no simple compatible

#44 opened 5 days ago by

kalle07

pcuenq

posted an update 2 days ago

Post

2323

👉 What happened in AI in 2025? 👈

We prepared the 2025 version of the HF AI Timeline Grid, highlighting open vs API-based model releases, and allowing you to browse and filter by access, modality, and release type!

Play with it here:
2025-ai-timeline/2025-ai-timeline

Here's my personal quarterly TL;DR:

1️⃣ Q1 — Learning to Reason
Deepseek not only releases a top-notch reasoning model, but shows how to train them and compete with closed frontier models. OpenAI debuts Deep Research.

Significant milestones: DeepSeek R1 & R1-Zero, Qwen 2.5 VL, OpenAI Deep Research, Gemini 2.5 Pro (experimental)

2️⃣ Q2 — Multimodality and Coding
More LLMs embrace multimodality by default, and there's a surge in coding agents. Strong vision, audio, and generative models emerge.

Significant milestones: Llama 4, Qwen 3, Imagen 4, OpenAI Codex, Google Jules, Claude 4

3️⃣ Q3 — "Gold" rush, OpenAI opens up, the community goes bananas
Flagship models get gold in Math olympiads and hard benchmarks. OpenAI releases strong open source models and Google releases the much anticipated nano-banana for image generation and editing. Agentic workflows become commonplace.

Significant milestones: Gemini and OpenAI IMO Gold, gpt-oss, Gemini 2.5 Flash Image, Grok 4, Claude Sonnet 4.5

4️⃣ Q4 — Mistral returns, leaderboard hill-climbing
Mistral is back with updated model families. All labs release impressive models to wrap up the year!

Significant milestones: Claude Opus 4.5, DeepSeek Math V2, FLUX 2, GPT 5.1, Kimi K2 Thinking, Nano Banana Pro, GLM 4.7, Gemini 3, Mistral 3, MiniMax M2.1 🤯

Credits
🙏 NHLOCAL for the source data https://github.com/NHLOCAL/AiTimeline

🫡 @reach-vb for the original idea, design and recipe

🙌 @ariG23498 and yours truly for compiling and verifying the 2025 edition

🥳 Here's to 2026, wishing it becomes the best year ever for open releases and on-device-first use-cases! 🥂

1 reply

danielhanchen

posted an update 7 days ago

Post

592

Run Qwen-Image-2512, the new SOTA text-to-image model! 💜

It's the top performing open diffusion model and has more realistic + accurate images/text.

Run locally with 14GB RAM via our Dynamic GGUF: unsloth/Qwen-Image-2512-GGUF

Guide: https://unsloth.ai/docs/models/qwen-image-2512

2 replies

danielhanchen

posted an update 15 days ago

Post

3867

You can now run GLM-4.7, the new 355B parameter SOTA model on your local device (128GB RAM).✨

The model achieves SOTA performance on coding, agentic and chat benchmarks.

GGUF: unsloth/GLM-4.7-GGUF
Guide: https://docs.unsloth.ai/models/glm-4.7

3 replies

ibibrahim

published a model 16 days ago

ibm-granite/granite-guardian-3.2-5b-lora-factuality-correction

Text Generation • Updated about 1 month ago

frreiss

in ibm-granite/granite-lib-rag-r1.0 19 days ago

Add Granite 3.3 8b LoRAs

#3 opened 19 days ago by

frreiss

danielhanchen

posted an update 20 days ago

Post

2325

Google releases FunctionGemma, a new 270M parameter model that runs on just 0.5 GB RAM.✨

Built for tool-calling, run locally on your phone at 50+ tokens/s, or fine-tune with Unsloth & deploy to your phone.

GGUF: unsloth/functiongemma-270m-it-GGUF
Docs + Notebook: https://docs.unsloth.ai/models/functiongemma

2 replies

gabegoodhart

in ibm-granite/granite-4.0-1b-GGUF 20 days ago

F16 broken

#1 opened 21 days ago by

ramendik

gabegoodhart

updated a model 20 days ago

ibm-granite/granite-4.0-1b-GGUF

2B • Updated 20 days ago • 1.42k • 2

abrooks9944

in ibm-granite/granite-speech-3.3-8b 21 days ago

Flash Attention v2 doesnt work

#20 opened 28 days ago by

abcdefghijklmnop52

frreiss

updated a model 22 days ago

ibm-granite/granite-lib-rag-r1.0

14.4M • Updated 2 days ago • 245 • 2

danielhanchen

posted an update 23 days ago

Post

5363

NVIDIA releases Nemotron 3 Nano, a new 30B hybrid reasoning model! 🔥

Has 1M context window & best in class performance for SWE-Bench, reasoning & chat. Run the MoE model locally with 24GB RAM.

GGUF: unsloth/Nemotron-3-Nano-30B-A3B-GGUF
💚 Step-by-step Guide: https://docs.unsloth.ai/models/nemotron-3

1 reply

frreiss

published a model 27 days ago

ibm-granite/granite-lib-rag-r1.0

14.4M • Updated 2 days ago • 245 • 2

danielhanchen

posted an update 27 days ago

Post

2086

Mistral's new SOTA coding models Devstral 2 can now be Run locally! (25GB RAM) 🐱
We fixed the chat template, so performance should be much better now!
24B: unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF
123B: unsloth/Devstral-2-123B-Instruct-2512-GGUF

🧡Step-by-step Guide: https://docs.unsloth.ai/models/devstral-2

AI & ML interests

Recent Activity

Papers

Articles

Granite 4.0 Nano: Just how small can you go?

Team members 103

ibm-granite's activity

Add more aLoRA files

Models and configs for Ollama backend

Wow I am not sure what to say about this result? I don't think I did anything 2 strange to cause this, and I swear I was not messing around with it or trying to stage a bug.

no simple compatible

Add Granite 3.3 8b LoRAs

F16 broken

Flash Attention v2 doesnt work