VLMs are Blind! Vision language models are blind Paper • 2407.06581 • Published Jul 9, 2024 • 84 XAI/vlmsareblind Viewer • Updated Nov 22, 2024 • 8.02k • 666 • 28 Runtime error Agents 4 VLMsAreBlind ResultsReview 📚 4 Review model results on visual tasks
ArXiv QA Automated ArXiv Question Answering with LLMs Running 16 ArXiv Daily Papers 📚 16 Browse daily arXiv paper summaries with filters taesiri/ArXiv Viewer • Updated Nov 12, 2025 • 19.2k • 112 Paused Agents 63 Claude Reads Arxiv 📖 63 taesiri/arxiv_qa Viewer • Updated Apr 15, 2024 • 211k • 511 • 138
VLMs are Blind! Vision language models are blind Paper • 2407.06581 • Published Jul 9, 2024 • 84 XAI/vlmsareblind Viewer • Updated Nov 22, 2024 • 8.02k • 666 • 28 Runtime error Agents 4 VLMsAreBlind ResultsReview 📚 4 Review model results on visual tasks
ArXiv QA Automated ArXiv Question Answering with LLMs Running 16 ArXiv Daily Papers 📚 16 Browse daily arXiv paper summaries with filters taesiri/ArXiv Viewer • Updated Nov 12, 2025 • 19.2k • 112 Paused Agents 63 Claude Reads Arxiv 📖 63 taesiri/arxiv_qa Viewer • Updated Apr 15, 2024 • 211k • 511 • 138
taesiri/BugsBunny-LLama-3.2-11B-Vision-BaseCaptioner-XLarge-FullModel Image-Text-to-Text • 11B • Updated Nov 26, 2024 • 6
taesiri/BugsBunny-LLama-3.2-11B-Vision-BaseCaptioner-Medium-FullModel Image-Text-to-Text • 11B • Updated Nov 16, 2024