view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 12 days ago • 86
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 26 days ago • 63
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 29 days ago • 258
Inference Optimized Checkpoints (with Model Optimizer) Collection A collection of generative models quantized and optimized for inference with Model Optimizer. • 46 items • Updated 6 days ago • 66
Tri Series Collection Introducing our new series of models: Tri-7B, Tri-21B, and Tri-70B-preview-SFT • 10 items • Updated Sep 10 • 8
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 393
H-Net Collection The family of hierarchical networks (H-Nets) from https://arxiv.org/abs/2507.07955 • 8 items • Updated Jul 11 • 20
view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance May 21 • 38