naver-hyperclovax/HyperCLOVAX-SEED-Vision-Instruct-3B Text Generation • 4B • Updated 17 days ago • 451k • 206
How Does Vision-Language Adaptation Impact the Safety of Vision Language Models? Paper • 2410.07571 • Published Oct 10, 2024 • 2
Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation Paper • 2401.06591 • Published Jan 12, 2024 • 4
SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap Paper • 2309.12382 • Published Sep 21, 2023
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis Paper • 1904.01906 • Published Apr 3, 2019
Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models Paper • 2305.15080 • Published May 24, 2023
Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation Paper • 2401.06591 • Published Jan 12, 2024 • 4