Running 2 2 Benchmark Contamination Monitoring System 📉 View and submit benchmarks for contamination analysis
Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index Paper • 2506.12229 • Published 13 days ago • 3 • 2
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Paper • 2505.24864 • Published 27 days ago • 131
massive-serve Collection One command to download and serve a datastore---that's it 😎. https://github.com/RulinShao/massive-serve • 8 items • Updated 20 days ago • 1
DataDecide: How to Predict Best Pretraining Data with Small Experiments Paper • 2504.11393 • Published Apr 15 • 18
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Paper • 2504.07096 • Published Apr 9 • 74
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Paper • 2504.07096 • Published Apr 9 • 74