Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Vision, Language and Reading

non-profit
https://www.vlr.ai/
Activity Feed

AI & ML interests

Multimodal AI, Document Understanding, Reading Systems.

Recent Activity

emanuelevivoli  authored a paper 21 days ago
CoSMo: A Multimodal Transformer for Page Stream Segmentation in Comic Books
emanuelevivoli  authored a paper 21 days ago
Multimodal Transformer for Comics Text-Cloze
Llabres  updated a dataset about 1 month ago
VLR-CVC/ComicsPAP
View all activity

Papers

ComicsPAP: understanding comic strips by picking the correct panel

One missing piece in Vision and Language: A Survey on Comics Understanding

View all Papers

Emanuele Vivoli's profile picture Tomás Ockier Poblet's profile picture Artemis Llabrés's profile picture Eric López's profile picture Khanh Nguyen's profile picture Mohamed Ali Souibgui's profile picture Dimosthenis Karatzas's profile picture Serra's profile picture

VLR-CVC 's models 2

VLR-CVC/Qwen2.5-VL-7B-Instruct-lora-ComicsPAP

Updated Apr 9

VLR-CVC/Qwen2.5-VL-3B-Instruct-lora-ComicsPAP

Updated Apr 9 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs