Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring Paper โข 2403.09333 โข Published Mar 14, 2024 โข 15 โข 3
Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring Paper โข 2403.09333 โข Published Mar 14, 2024 โข 15 โข 3
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models Paper โข 2404.07973 โข Published Apr 11, 2024 โข 31 โข 3