ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding Paper • 2603.27064 • Published about 1 month ago • 27
view article Article Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents 27 days ago • 34
Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs Paper • 2603.16932 • Published Mar 14 • 87
NLE: Non-autoregressive LLM-based ASR by Transcript Editing Paper • 2603.08397 • Published Mar 9 • 22
NLE: Non-autoregressive LLM-based ASR by Transcript Editing Paper • 2603.08397 • Published Mar 9 • 22
ibm-granite/granite-4.0-1b-speech Automatic Speech Recognition • 2B • Updated 25 days ago • 95.1k • 233
Granite 4.0 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 11 items • Updated 25 days ago • 218
Granite 4.0 Nano Language Models Collection Ultra-compact language models designed for the edge and on-device deployment. • 9 items • Updated 26 days ago • 100
Advancing Speech Understanding in Speech-Aware Language Models with GRPO Paper • 2509.16990 • Published Sep 21, 2025 • 22
Advancing Speech Understanding in Speech-Aware Language Models with GRPO Paper • 2509.16990 • Published Sep 21, 2025 • 22