PleIAs

Team

company

Activity Feed

AI & ML interests

Open Science LLMs

Recent Activity

Pclanglais updated a model 5 days ago

Pclanglais/POntAvignon-4b

Pclanglais published a model 5 days ago

Pclanglais/POntAvignon-4b

Pclanglais updated a model 10 days ago

JZSG/baguette-funders-600m

View all activity

Organization Card

Community About org cards

PleIAs is a French private AI Lab training the next generation of Language Models for document processing.

PleIAs is committed to open science and has coordinated the release of some of the largest open corpus for pre-training.

For more information, visit our website : https://pleias.fr/

Collections 10

View 10 collections

spaces 7

baguettotron_demo

📜

Vintage OCR Corrector (GPU)

📜

Correct OCR errors in your text

Vintage OCR Corrector (CPU)

📜

Correct OCR errors in text

Finance Commons Explorer

💻

Browse finance datasets on Hugging Face

Reversed-Zotero

📜

View 7 Spaces

models 29

datasets 57

PleIAs/French-Science-Commons

Viewer • Updated 25 days ago • 42.6M • 2.04k • 18

PleIAs/BSF_Redline

Viewer • Updated Feb 27 • 1.05M • 48

PleIAs/common_corpus

Viewer • Updated Feb 19 • 69.9k • 242k • 389

PleIAs/Japanese-PD

Viewer • Updated Feb 16 • 1.38M • 172 • 1

PleIAs/Arabic-PD

Viewer • Updated Feb 16 • 221k • 123

PleIAs/verse-wikisource

Preview • Updated Nov 11, 2025 • 38 • 3

PleIAs/SYNTH

Viewer • Updated Nov 11, 2025 • 68M • 42.8k • 260

PleIAs/Youtube-Commons-Audio-Sample-1000

Updated Oct 11, 2025 • 10

PleIAs/gpt-oss20b-samples-dedup

Viewer • Updated Aug 9, 2025 • 179k • 93 • 5

PleIAs/Post-OCR-Correction

Viewer • Updated Jul 7, 2025 • 50.4k • 718 • 135

View 57 datasets