Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models Paper • 2506.06395 • Published 8 days ago • 95 • 18
Running 2.68k 2.68k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Open LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 605