Optimized Table Tokenization for Table Structure Recognition Paper โข 2305.03393 โข Published May 5, 2023
KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents Paper โข 2405.00505 โข Published May 1, 2024
TableFormer: Table Structure Understanding with Transformers Paper โข 2203.01017 โข Published Mar 2, 2022
Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion Paper โข 2501.17887 โข Published Jan 27 โข 1
Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence Paper โข 2502.09927 โข Published Feb 14
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper โข 2503.11576 โข Published Mar 14 โข 117