view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) By natolambert and 3 others • Dec 9, 2022 • 287
Running 2.72k 2.72k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
LLM Reasoning Papers Collection improve reasoning capabilities of LLMs • 45 items • Updated Feb 18 • 5
Running on CPU Upgrade 9.14k 9.14k Kolors Virtual Try-On 👕 Try on clothes virtually by uploading images