Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Lingyu Kong's picture
1 8 6

Lingyu Kong

kppkkp
deepak191z's profile picture PeepDaSlan9's profile picture jizhongpeng's profile picture
·
https://www.kppkkp.top/
  • LingyvKong

AI & ML interests

None yet

Organizations

None yet

authored 2 papers 9 months ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 84

Focus Anywhere for Fine-grained Multi-page Document Understanding

Paper • 2405.14295 • Published May 23, 2024 • 1
authored 3 papers about 1 year ago

Merlin:Empowering Multimodal LLMs with Foresight Minds

Paper • 2312.00589 • Published Nov 30, 2023 • 27

Vary: Scaling up the Vision Vocabulary for Large Vision-Language Models

Paper • 2312.06109 • Published Dec 11, 2023 • 21

OneChart: Purify the Chart Structural Extraction via One Auxiliary Token

Paper • 2404.09987 • Published Apr 15, 2024 • 2
authored a paper over 1 year ago

Small Language Model Meets with Reinforced Vision Vocabulary

Paper • 2401.12503 • Published Jan 23, 2024 • 33
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs