view article Article The N Implementation Details of RLHF with PPO By vwxyzjn and 2 others • Oct 24, 2023 • 67
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • Aug 5 • 490
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) By natolambert and 3 others • Dec 9, 2022 • 337
view article Article Mixture of Experts Explained By osanseviero and 5 others • Dec 11, 2023 • 882
Running 3.16k 3.16k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • Jul 8 • 647
view article Article Open LLM Leaderboard: DROP deep dive By clefourrier and 4 others • Dec 1, 2023 • 9
view article Article What's going on with the Open LLM Leaderboard? By clefourrier and 3 others • Jun 23, 2023 • 43
arun-AiBharat/BookCorpus_Chunked_1K_Tokens_GPT2_Pretraining Viewer • Updated Sep 30, 2024 • 1.06M • 1