8 1 6

Muche

Glider95

AI & ML interests

None yet

Recent Activity

liked a model 27 days ago

prithivMLmods/docscopeOCR-7B-050425-exp

replied to prithivMLmods's post 27 days ago

The demo for smoldocling / nanonets ocr / typhoon ocr / monkey ocr explores the document OCR capabilities of various newly released multimodal VLMs in a single space. And if you're experiencing or demoing long document image OCR, kindly use the Smoldocling 256M preview [ Smoldocling is back in demo here. ] 🤗. ✦ Try the demo here : https://huggingface.co/spaces/prithivMLmods/Multimodal-OCR2 ⤷ MonkeyOCR Recognition : https://huggingface.co/echo840/MonkeyOCR/tree/main/Recognition ⤷ Nanonets-OCR-s : https://huggingface.co/nanonets/Nanonets-OCR-s ⤷ SmolDocling-256M-preview : https://huggingface.co/ds4sd/SmolDocling-256M-preview ⤷ typhoon-ocr-7b : https://huggingface.co/scb10x/typhoon-ocr-7b ⤷ Multimodal Implementations : https://huggingface.co/collections/prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0 ⤷ Github : https://github.com/PRITHIVSAKTHIUR/Multimodal-OCR2 The community GPU grant was given by Hugging Face — special thanks to them. 🤗🚀 To know more about it, visit the model card of the respective model. !!

liked a dataset 3 months ago

OpenGVLab/MMPR-v1.2

View all activity

Organizations

liked a model 27 days ago

prithivMLmods/docscopeOCR-7B-050425-exp

Image-Text-to-Text • 8B • Updated Jun 1 • 2.32k • 7

replied to prithivMLmods's post 27 days ago

Hello @prithivMLmods ,

Would it be possible to add Dolphin as well ? :)

Cheers,

liked a dataset 3 months ago

OpenGVLab/MMPR-v1.2

Updated May 29 • 17.7k • 23

New activity in HuggingFaceTB/SmolVLM-Base 3 months ago

Best dataset for highest DocVQA

#6 opened 8 months ago by

supercharge19

liked 2 datasets 3 months ago

HuggingFaceM4/Docmatix

Viewer • Updated Aug 26, 2024 • 2.55M • 9.39k • 281

allenai/olmOCR-mix-0225

Viewer • Updated Feb 25 • 259k • 1.73k • 146

upvoted a collection 5 months ago

olmOCR

Collection

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 4 items • Updated Jun 17 • 119

liked a model 5 months ago

allenai/olmOCR-7B-0225-preview

Image-to-Text • 8B • Updated Feb 25 • 117k • 682

reacted to merve's post with 😎 8 months ago

Post

2698

small but mighty 🔥
you can fine-tune SmolVLM on an L4 with batch size of 4 and it will only take 16.4 GB VRAM 🫰🏻 also with gradient accumulation simulated batch size is 16 ✨
I made a notebook that includes all the goodies: QLoRA, gradient accumulation, gradient checkpointing with explanations on how they work 💝 https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb

New activity in HuggingFaceM4/Idefics3-8B-Llama3 8 months ago

Any Idea When This Will Be Supported in TGI?

#3 opened 11 months ago by

pr1me

liked a model 9 months ago

PleIAs/OCRonos

Text Generation • 8B • Updated Jul 18, 2024 • 166 • 62