Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Vision, Language and Reading

non-profit
https://www.vlr.ai/
Activity Feed

AI & ML interests

Multimodal AI, Document Understanding, Reading Systems.

Recent Activity

emanuelevivoli  authored a paper 26 days ago
CoSMo: A Multimodal Transformer for Page Stream Segmentation in Comic Books
emanuelevivoli  authored a paper 26 days ago
Multimodal Transformer for Comics Text-Cloze
Llabres  updated a dataset about 1 month ago
VLR-CVC/ComicsPAP
View all activity

Papers

ComicsPAP: understanding comic strips by picking the correct panel

One missing piece in Vision and Language: A Survey on Comics Understanding

View all Papers

Emanuele Vivoli's profile picture Tomás Ockier Poblet's profile picture Artemis Llabrés's profile picture Eric López's profile picture Khanh Nguyen's profile picture Mohamed Ali Souibgui's profile picture Dimosthenis Karatzas's profile picture Serra's profile picture
VLR-CVC 's Papers 2
2

ComicsPAP: understanding comic strips by picking the correct panel

VLR-CVC Vision, Language and Reading
4
Submitted by Emanuele Vivoli
25

One missing piece in Vision and Language: A Survey on Comics Understanding

VLR-CVC Vision, Language and Reading
128 2
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs