Running 3.67k The Ultra-Scale Playbook π 3.67k The ultimate guide to training LLM on large GPU Clusters
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds Paper β’ 2511.08892 β’ Published Nov 12, 2025 β’ 209
Running on CPU Upgrade Featured 2.95k The Smol Training Playbook π 2.95k The secrets to building world-class LLMs
The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation Paper β’ 2510.23393 β’ Published Oct 27, 2025 β’ 21
view article Article Building the Open Agent Ecosystem Together: Introducing OpenEnv +8 Oct 23, 2025 β’ 148
The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management Paper β’ 2508.21433 β’ Published Aug 29, 2025 β’ 7
On Pretraining for Project-Level Code Completion Paper β’ 2510.13697 β’ Published Oct 15, 2025 β’ 7
π Repository-Level Pre-Trained OpenCoder π§© Collection All the checkpoints from Table 3 of the paper βOn Pretraining for Project-Level Code Completion.β β’ 33 items β’ Updated Oct 17, 2025 β’ 3
PIPer: On-Device Environment Setup via Online Reinforcement Learning Paper β’ 2509.25455 β’ Published Sep 29, 2025 β’ 38
PIPer: On-Device Environment Setup via Online Reinforcement Learning Paper β’ 2509.25455 β’ Published Sep 29, 2025 β’ 38