ATLAS Benchmark
ATLAS for Frontier Scientific Benchmark
None defined yet.
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM
ATLAS for Frontier Scientific Benchmark
A Gallery of Generation Results on RISEBench
A Leaderboard for LMM spatial understanding capabilities
VLMEvalKit Subjectivce Benchmark Results
Compass Academic Leaderboard Full Version
A Leaderboard that demonstrates LMM reasoning capabilities
Compass Academic Leaderboard
VLMEvalKit Evaluation Results Collection
Explore MMBench Leaderboard data
VLMEvalKit Eval Results in video understanding benchmark
CompassJudger Subjective Evaluation Learderboard
JudgerBench Leaderboard
Display a web page
Evaluate code snippets across multiple languages
Display CompassArena platform
Display a web page