nex-agi/agent-sft
Preview
•
Updated
•
158
•
105
AGI, Nex
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping