Dataset and Models focus on long video-language understanding.
Jingyang Lin
jylins
·
AI & ML interests
vision-and-language
Recent Activity
updated
a dataset
12 days ago
estsafda/test
published
a dataset
13 days ago
estsafda/test
upvoted
a
paper
19 days ago
VCR-Bench: A Comprehensive Evaluation Framework for Video
Chain-of-Thought Reasoning