VLR-CVC/Qwen2.5-VL-7B-Instruct-lora-ComicsPAP
Updated
Multimodal AI, Document Understanding, Reading Systems.
ComicsPAP: understanding comic strips by picking the correct panel
One missing piece in Vision and Language: A Survey on Comics Understanding