arxiv:2501.08326
Min-Hung Chen
cmhungsteve
AI & ML interests
Multimodal AI, Transfer Learning, Unsupervised Learning, Video Understanding, Vision Transformer, Computer Vision, Deep Learning
Recent Activity
authored
a paper
3 days ago
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token
Marks
upvoted
a
paper
3 days ago
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token
Marks
commented on
a paper
3 days ago
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token
Marks
Organizations
Papers
25
models
None public yet
datasets
None public yet