arxiv:2604.14683
Qianqian Xie
mistletoe111
AI & ML interests
None yet
Recent Activity
authored a paper about 1 hour ago
MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating
Multimodal LLMs in Multi-Turn Dialogues authored a paper about 1 hour ago
IF-VidCap: Can Video Caption Models Follow Instructions? authored a paper about 2 hours ago
DR$^{3}$-Eval: Towards Realistic and Reproducible Deep Research Evaluation