Optimized Table Tokenization for Table Structure Recognition Paper • 2305.03393 • Published May 5, 2023
KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents Paper • 2405.00505 • Published May 1, 2024
TableFormer: Table Structure Understanding with Transformers Paper • 2203.01017 • Published Mar 2, 2022
Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion Paper • 2501.17887 • Published Jan 27
Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence Paper • 2502.09927 • Published Feb 14
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published 10 days ago • 72