-
Emu3: Next-Token Prediction is All You Need
Paper • 2409.18869 • Published • 94 -
Harnessing Webpage UIs for Text-Rich Visual Understanding
Paper • 2410.13824 • Published • 31 -
PaliGemma: A versatile 3B VLM for transfer
Paper • 2407.07726 • Published • 70 -
YaRN: Efficient Context Window Extension of Large Language Models
Paper • 2309.00071 • Published • 68
aman prakash
MLap
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 20 hours ago
MLap/Book-Scan-OCR
liked
a Space
3 days ago
facebook/seamless-streaming
updated
a model
6 days ago
MLap/Llama-3.2-Vision-OCR
Organizations
None yet