Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lbourdois 's Collections
French packs
French Translations
FAT5
Breton packs
French NER
French QA
French prompts datasets
French embedding datasets
French VQA datasets
French caption datasets
French OCR datasets
French retriever datasets
French table-to-text datasets
French audio datasets (pretraining)

French OCR datasets

updated 5 days ago

Datasets I cleaned with an image, a prompt question (like "transcribe the text in this image") and an answer. Can be used to train VLMs.

Upvote
-

  • lbourdois/OCR-neulab-PangeaInstruct-OCR-clean

    Viewer • Updated 29 days ago • 30k • 259

  • lbourdois/OCR-liboaccn-OPUS-MIT-5M-clean

    Viewer • Updated 29 days ago • 530k • 70
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs