Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
13
Xi Yang
xiyang99
Follow
thomwolf's profile picture
Nadilazev's profile picture
SteveSHEN's profile picture
7 followers
·
1 following
AI & ML interests
None yet
Recent Activity
authored
a paper
13 days ago
CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning
authored
a paper
13 days ago
HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation
authored
a paper
13 days ago
MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding
View all activity
Organizations
Articles
1
Article
33
Letting Large Models Debate: The First Multilingual LLM Debate Competition
Papers
13
arxiv:
2509.17177
arxiv:
2508.11252
arxiv:
2508.10015
arxiv:
2508.02178
Expand 13 papers
models
0
None public yet
datasets
0
None public yet