LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning Paper • 2503.04812 • Published 9 days ago • 12
AVG-LLaVA: A Large Multimodal Model with Adaptive Visual Granularity Paper • 2410.02745 • Published Sep 20, 2024
Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training Paper • 2410.04439 • Published Oct 6, 2024
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning Paper • 2503.04812 • Published 9 days ago • 12
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning Paper • 2503.04812 • Published 9 days ago • 12 • 3
LLaVE Collection LLaVE is a series of large language and vision embedding models trained on a variety of multimodal embedding datasets • 4 items • Updated 3 days ago • 8
LLaVE Collection LLaVE is a series of large language and vision embedding models trained on a variety of multimodal embedding datasets • 4 items • Updated 3 days ago • 8
LLaVE Collection LLaVE is a series of large language and vision embedding models trained on a variety of multimodal embedding datasets • 4 items • Updated 3 days ago • 8