Éclair -- Extracting Content and Layout with Integrated Reading Order for Documents Paper • 2502.04223 • Published Feb 6 • 10
olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models Paper • 2502.18443 • Published Feb 25 • 1
A Token-level Text Image Foundation Model for Document Understanding Paper • 2503.02304 • Published Mar 4 • 4