11 46 179

dinhanhx

dinhanhx

AI & ML interests

Vision Language

Recent Activity

upvoted an article 1 day ago

Large-scale Near-deduplication Behind BigCode

liked a model 1 day ago

ChatDOC/OCRFlux-3B

upvoted an article 8 days ago

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

View all activity

Organizations

upvoted an article 1 day ago

Article

Large-scale Near-deduplication Behind BigCode

•

May 16, 2023

• 31

liked a model 1 day ago

ChatDOC/OCRFlux-3B

Image-Text-to-Text • 4B • Updated 4 days ago • 4.09k • 198

upvoted an article 8 days ago

Article

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

and 10 others •

9 days ago

• 23

liked 2 models 11 days ago

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated 16 days ago • 266k • 1.34k

reducto/RolmOCR

Image-Text-to-Text • 8B • Updated Apr 2 • 118k • 445

liked a model 27 days ago

wybxc/DocLayout-YOLO-DocStructBench-onnx

Updated Jan 8 • 7

liked a model about 1 month ago

google/gemma-3-4b-it

Image-Text-to-Text • 4B • Updated Mar 21 • 1.34M • • 689

liked a Space about 1 month ago

YOLOv11 Document Layout Analysis

🏃

inference example of trained YOLOv11-x on DocLayNet dataset.

liked 5 models about 2 months ago

microsoft/layoutlmv3-base

0.1B • Updated Apr 10, 2024 • 1.38M • 418

Snowflake/snowflake-arctic-embed-l-v2.0

GreenNode/GreenNode-Embedding-Large-VN-V1

tiiuae/Falcon3-10B-Base-1.58bit

Text Generation • 3B • Updated Dec 20, 2024 • 46 • 7

unsloth/Qwen2.5-VL-7B-Instruct-unsloth-bnb-4bit

Image-Text-to-Text • 5B • Updated May 12 • 38.2k • 34

upvoted an article about 2 months ago

Article

SigLIP 2: A better multilingual vision language encoder

and 2 others •

Feb 21

• 172

upvoted a collection 2 months ago

DocAI

Collection

20 items • Updated Apr 20 • 1

liked a model 2 months ago

nvidia/Llama-3.1-Nemotron-Nano-8B-v1

Text Generation • 8B • Updated May 8 • 420k • • 189

upvoted an article 2 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

and 2 others •

Apr 15, 2024

• 182

liked a Space 2 months ago

102

Idefics3

📊

Generate text based on an image and prompt

upvoted an article 2 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

•

Mar 26

• 143

liked a model 2 months ago

google/siglip2-giant-opt-patch16-384

Zero-Shot Image Classification • 2B • Updated Feb 21 • 69.1k • 17

dinhanhx

AI & ML interests

Recent Activity

Organizations

dinhanhx's activity

Large-scale Near-deduplication Behind BigCode

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

YOLOv11 Document Layout Analysis

SigLIP 2: A better multilingual vision language encoder

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Idefics3

Training and Finetuning Reranker Models with Sentence Transformers v4