Running 1.23k 1.23k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 15 days ago • 113
Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 Paper • 2502.03544 • Published 17 days ago • 42 • 5
Great Models Think Alike and this Undermines AI Oversight Paper • 2502.04313 • Published 16 days ago • 29
Great Models Think Alike and this Undermines AI Oversight Paper • 2502.04313 • Published 16 days ago • 29
Great Models Think Alike and this Undermines AI Oversight Paper • 2502.04313 • Published 16 days ago • 29 • 2
Representation Engineering: A Top-Down Approach to AI Transparency Paper • 2310.01405 • Published Oct 2, 2023 • 5
The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning Paper • 2403.03218 • Published Mar 5, 2024 • 1