NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 6 items • Updated 7 days ago • 114
view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models 23 days ago • 104
Nemotron-Personas Collection A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions. • 3 items • Updated 15 days ago • 14
view article Article Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B Aug 18, 2025 • 31
view article Article Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B Jun 10, 2025 • 7
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning Paper • 2504.11409 • Published Apr 15, 2025 • 9
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 46
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models Paper • 2409.17481 • Published Sep 26, 2024 • 47
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 15 days ago • 62
LLM Pruning and Distillation in Practice: The Minitron Approach Paper • 2408.11796 • Published Aug 21, 2024 • 58