Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 124
4M Models Collection Multimodal models from https://4m.epfl.ch/ • 14 items • Updated Jun 14, 2024 • 31