zhaoyuzhong
callsys
ยท
AI & ML interests
computer vision
Recent Activity
upvoted
a
paper
about 2 months ago
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand
Audio-Visual Information?
upvoted
a
paper
about 2 months ago
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
new activity
5 months ago
microsoft/kosmos-2.5:Upload receipt_00008.png
Organizations
None yet
models
None public yet
datasets
None public yet