Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
merve
's Collections
Sep 1 Releases
August 29 Releases
Aug 22 Releases
Releases August 9
Releases August 2
Releases July 25
Releases July 18
Releases July 11
Releases July 4
Releases June 27
June 20 Releases
OCR Models & Datasets
Releases June 13
Releases June 6
Releases 30 May
Releases 23 May
May 16 Releases
May 9 Releases
Any-to-Any Models, Datasets, Spaces
Releases Apr 21 & May 2
InternVL3 HF
April 16 Releases
Multimodal DSE Retrievers
April 11 Releases
March 28 Releases
March 21 Releases
TΓΌrkΓ§e VLMler
Feb 14 Releases π
Feb 7 Releases π§£
January 31 Releases π§€
Models, Jan 27
Jan 24 Releases
Jan 17 Releases βοΈ
Jan 10 Releases π¨οΈ
Dec 6 Releases π
Nov 29 Releases π²π²
Nov 22 Releases βοΈ
Nov 15 Releases π
Nov 1 Releases
MIT Talk 31/10 Papers
October 25 Releases
LOTUS πͺ·
New Depth Models
BRAVE Models π¦
Computer Vision Backbones π§©
Image Classification Models πΆ π±
Object Detection Models π₯₯
Image Segmentation Models π
Zero-shot Image Classification Models πΌοΈ
Image-to-Image Models π¨
Video Classification Models πΊ
Image-to-Text Models π
Text-to-Image Models π₯
Foundation Models for Vision π§©
Segment Anything Model
OWL-series π¦
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers πΌοΈπ¬π
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
Feb 14 Releases π
updated
Feb 14
Upvote
7
OpenGVLab/InternVideo2_5_Chat_8B
Video-Text-to-Text
β’
8B
β’
Updated
Aug 4
β’
9.38k
β’
82
AIDC-AI/Ovis2-34B
Image-Text-to-Text
β’
35B
β’
Updated
24 days ago
β’
21.8k
β’
151
open-r1/OpenR1-Qwen-7B
Text Generation
β’
8B
β’
Updated
May 28
β’
1.17k
β’
β’
54
nomic-ai/nomic-embed-text-v2-moe
Sentence Similarity
β’
0.5B
β’
Updated
Apr 1
β’
289k
β’
429
Zyphra/Zonos-v0.1-hybrid
Text-to-Speech
β’
Updated
Jun 3
β’
25k
β’
1.1k
agentica-org/DeepScaleR-1.5B-Preview
Text Generation
β’
2B
β’
Updated
Apr 9
β’
29.9k
β’
571
open-r1/OpenR1-Math-Raw
Viewer
β’
Updated
Feb 24
β’
516k
β’
381
β’
74
open-r1/OpenR1-Math-220k
Viewer
β’
Updated
Feb 18
β’
450k
β’
16.3k
β’
645
Zyphra/Zonos-v0.1-transformer
Text-to-Speech
β’
Updated
Jun 3
β’
31.5k
β’
412
AIDC-AI/Ovis2-1B
Image-Text-to-Text
β’
1B
β’
Updated
24 days ago
β’
61.3k
β’
92
AIDC-AI/Ovis2-16B
Image-Text-to-Text
β’
16B
β’
Updated
24 days ago
β’
81.4k
β’
100
AIDC-AI/Ovis2-2B
Image-Text-to-Text
β’
2B
β’
Updated
24 days ago
β’
26.6k
β’
59
AIDC-AI/Ovis2-8B
Image-Text-to-Text
β’
9B
β’
Updated
24 days ago
β’
91.7k
β’
73
AIDC-AI/Ovis2-4B
Image-Text-to-Text
β’
5B
β’
Updated
24 days ago
β’
43.7k
β’
61
sbintuitions/modernbert-ja-130m
Fill-Mask
β’
0.1B
β’
Updated
May 1
β’
5.8k
β’
β’
45
Zyphra/Zonos-v0.1-speaker-embedding
Updated
Feb 12
β’
28
GAIR/LIMO
33B
β’
Updated
Feb 6
β’
1.14k
β’
43
prithivMLmods/Hoags-2B-Exp
Image-Text-to-Text
β’
2B
β’
Updated
Feb 15
β’
7
β’
3
Metric-AI/ColQwenStella-2b-multilingual
Visual Document Retrieval
β’
Updated
Mar 25
β’
4
β’
9
apple/DepthPro-hf
Depth Estimation
β’
1.0B
β’
Updated
Feb 28
β’
17.3k
β’
64
Liberata/illustrious-xl-v1.0
Text-to-Image
β’
Updated
Feb 12
β’
144
OpenGVLab/InternVL_2_5_HiCo_R16
Video-Text-to-Text
β’
8B
β’
Updated
Feb 13
β’
2.19k
β’
5
OpenGVLab/InternVL_2_5_HiCo_R64
Video-Text-to-Text
β’
8B
β’
Updated
May 13
β’
94
β’
3
Upvote
7
+3
Share collection
View history
Collection guide
Browse collections