view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance Apr 16, 2025 • 58
Reinforcement Learning for Self-Improving Agent with Skill Library Paper • 2512.17102 • Published 17 days ago • 30
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 17 days ago • 93
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 3 days ago • 41
NeMo Gym Collection Collection of RL verifiable data for NeMo Gym • 13 items • Updated 12 days ago • 32
ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models Paper • 2512.07843 • Published Nov 24, 2025 • 21
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction Paper • 2512.04987 • Published about 1 month ago • 76
Continual Learning, Not Training: Online Adaptation For Agents Paper • 2511.01093 • Published Nov 2, 2025 • 1
view article Article Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes Oct 22, 2025 • 11
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper • 2512.01374 • Published Dec 1, 2025 • 95
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 264
Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs Paper • 2511.16664 • Published Nov 20, 2025 • 26
view article Article Building the Open Agent Ecosystem Together: Introducing OpenEnv +8 Oct 23, 2025 • 139