BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset Paper β’ 2505.09568 β’ Published May 14 β’ 95
BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions Paper β’ 2411.07461 β’ Published Nov 12, 2024 β’ 24
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper β’ 2408.08872 β’ Published Aug 16, 2024 β’ 101
π MINT-1T Collection Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" β’ 13 items β’ Updated Jul 24, 2024 β’ 62