Collections

Discover the best community collections!

Collections including paper arxiv:2412.04432
video LM
Collection by 21 days ago
Video
Collection by 27 days ago
Unified MLLM
Unified model that generate Text, Image, Video
Cognition
Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend.
VisionLM
Collection by 8 days ago
video
Collection by 6 days ago
daily papers
Collection by 11 days ago