LayoutLM - a microsoft Collection

microsoft 's Collections

MediPhi

Phi-4

Phi-3

Phi-1

Controllable Safety Alignment

BitNet

TAPEX

Table Transformer

Orca

UDOP

GIT

IFMs

LayoutLM

updated May 1

The LayoutLM series are Transformer encoders useful for document AI tasks such as invoice parsing, document image classification and DocVQA.

microsoft/layoutlmv3-base

0.1B • Updated Apr 10, 2024 • 1.22M • 426

Note Currently the best LayoutLM model.
microsoft/layoutlmv2-base-uncased

Updated Sep 16, 2022 • 439k • 67
microsoft/layoutlm-base-uncased

0.1B • Updated Apr 16, 2024 • 351k • 58
microsoft/layoutxlm-base

Updated Sep 16, 2022 • 8.15k • 72

Note A multilingual variant trained on 100 languages.
impira/layoutlm-document-qa

Document Question Answering • 0.1B • Updated Mar 18, 2023 • 18.5k • 1.12k

Note A LayoutLM (v1) model fine-tuned to perform question answering over documents (DocVQA).
nielsr/layoutlmv3-finetuned-funsd

Token Classification • 0.1B • Updated Sep 16, 2023 • 3.45k • • 29

Note A LayoutLMv3 model fine-tuned on the FUNSD dataset, a benchmark for document parsing.