Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
merve
's Collections
Jan 17 Releases βοΈ
Jan 10 Releases π¨οΈ
Dec 6 Releases π
Nov 29 Releases π²π²
Nov 22 Releases βοΈ
Nov 15 Releases π
Nov 1 Releases
MIT Talk 31/10 Papers
October 25 Releases
LOTUS πͺ·
New Depth Models
BRAVE Models π¦
Computer Vision Backbones π§©
Image Classification Models πΆ π±
Object Detection Models π₯₯
Image Segmentation Models π
Zero-shot Image Classification Models πΌοΈ
Image-to-Image Models π¨
Video Classification Models πΊ
Image-to-Text Models π
Text-to-Image Models π₯
Foundation Models for Vision π§©
Segment Anything Model
OWL-series π¦
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers πΌοΈπ¬π
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
Multimodal RAG
updated
Sep 5, 2024
Upvote
25
+15
vidore/colpali-v1.2
Image Feature Extraction
β’
Updated
5 days ago
β’
100k
β’
104
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
β’
Updated
6 days ago
β’
1.73M
β’
β’
1.06k
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
β’
Updated
6 days ago
β’
1.91M
β’
373
Qwen/Qwen2-72B-Instruct
Text Generation
β’
Updated
Oct 8, 2024
β’
38.1k
β’
694
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
β’
Updated
3 days ago
β’
65.3k
β’
911
Running
596
π
Qwen2-VL-72B
Running
on
Zero
109
π
ColPali
Document Retrieval
vidore/colpali_train_set
Viewer
β’
Updated
Sep 4, 2024
β’
119k
β’
1.18k
β’
70
lmms-lab/llava-onevision-qwen2-7b-ov
Text Generation
β’
Updated
Sep 2, 2024
β’
332k
β’
42
HuggingFaceM4/Idefics3-8B-Llama3
Image-Text-to-Text
β’
Updated
Dec 2, 2024
β’
20.1k
β’
262
Upvote
25
+21
Share collection
View history
Collection guide
Browse collections