-
Open VLM Leaderboard
🌎952VLMEvalKit Evaluation Results Collection
-
Open VLM Video Leaderboard
🌎129VLMEvalKit Eval Results in video understanding benchmark
-
Open LMM Reasoning Leaderboard
🥇44A Leaderboard that demonstrates LMM reasoning capabilities
-
MMBench Leaderboard
🚀24Explore MMBench Leaderboard data
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM
-
Open VLM Leaderboard
🌎952VLMEvalKit Evaluation Results Collection
-
Open VLM Video Leaderboard
🌎129VLMEvalKit Eval Results in video understanding benchmark
-
Open LMM Reasoning Leaderboard
🥇44A Leaderboard that demonstrates LMM reasoning capabilities
-
MMBench Leaderboard
🚀24Explore MMBench Leaderboard data
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward