Chatbot Arena Leaderboard
Display chatbot leaderboard and statistics
Display chatbot leaderboard and statistics
Track, rank and evaluate open LLMs and chatbots
Embedding Leaderboard
Request evaluation for new speech models
Explore LLM performance across hardware
Submit code models for evaluation on benchmarks
Generate animated avatars from images
View and submit LLM evaluations
View and submit machine learning model evaluations
Analyze images to detect and label objects
Evaluate LLM cybersecurity risks
Display model benchmark results
View LLM Performance Leaderboard
Explore benchmark results for QA and long doc models
VLMEvalKit Evaluation Results Collection
Explore and analyze RewardBench leaderboard data
Explore and analyze code evaluation data
Display and filter multimodal model leaderboard results
Access language translation services through an embedded interface
Visualize Open vs. Proprietary LLM Progress
Vote on AI responses to rank models
Demo of the new, massively multilingual leaderboard