view article Article Ulysses Sequence Parallelism: Training with Million-Token Contexts 6 days ago • 20
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 5 days ago • 49
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 12 items • Updated 4 days ago • 197
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated about 16 hours ago • 13.1k • 192