Speed Always Wins: A Survey on Efficient Architectures for Large Language Models Paper • 2508.09834 • Published 8 days ago • 41
BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining Paper • 2508.10975 • Published 6 days ago • 48
PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts Paper • 2508.09848 • Published 7 days ago • 63
Matrix-3D: Omnidirectional Explorable 3D World Generation Paper • 2508.08086 • Published 9 days ago • 67
HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches Paper • 2508.08088 • Published 9 days ago • 28
Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL Paper • 2508.07976 • Published 10 days ago • 45
view article Article TextQuests: How Good are LLMs at Text-Based Video Games? By justinphan3110 and 1 other • 9 days ago • 24
Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning Paper • 2508.03501 • Published 15 days ago • 52
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper • 2508.01191 • Published 19 days ago • 215
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward Paper • 2508.03686 • Published 15 days ago • 32
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 16 days ago • 467