LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Paper • 2501.03895 • Published 11 days ago • 48
Multimodal Models Collection Multimodal models with leading performance. • 17 items • Updated 1 day ago • 28
Models Trained on Ultra Series Collection The collection of open-source models that adopt Ultra Series datasets for training • 22 items • Updated Oct 22, 2024 • 4