Speed Always Wins: A Survey on Efficient Architectures for Large Language Models Paper • 2508.09834 • Published 8 days ago • 41
BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining Paper • 2508.10975 • Published 6 days ago • 48
PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts Paper • 2508.09848 • Published 7 days ago • 63
Matrix-3D: Omnidirectional Explorable 3D World Generation Paper • 2508.08086 • Published 9 days ago • 67
HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches Paper • 2508.08088 • Published 9 days ago • 28
Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL Paper • 2508.07976 • Published 10 days ago • 45
view article Article TextQuests: How Good are LLMs at Text-Based Video Games? By justinphan3110 and 1 other • 9 days ago • 24
Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning Paper • 2508.03501 • Published 15 days ago • 52
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper • 2508.01191 • Published 19 days ago • 215
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward Paper • 2508.03686 • Published 15 days ago • 32
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 16 days ago • 467
Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report Paper • 2508.01059 • Published 19 days ago • 32
Persona Vectors: Monitoring and Controlling Character Traits in Language Models Paper • 2507.21509 • Published 23 days ago • 29
GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface Paper • 2507.18546 • Published 27 days ago • 18
Common Pile v0.1 Raw Data Collection 8TB of public domain and openly licensed text • 30 items • Updated 6 days ago • 18
Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning Paper • 2507.14137 • Published Jul 18 • 33
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published Jul 17 • 245