Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
Backyard AI
Jan
Jellybox
llama.cpp
LM Studio
LocalAI
Msty
node-llama-cpp
Ollama
RecurseChat
Sanctum
TGI
vLLM
Apps with no match
Draw Things
DiffusionBee
Invoke
JoyFusion
MLX LM
Inference Providers
Inference Providers with no match
Novita
Fireworks
Nebius AI
Together AI
Cerebras
Featherless AI
Hyperbolic
Nscale
SambaNova
fal
Groq
Replicate
Cohere
HF Inference API
Misc
Reset Misc
vision-language
Inference Endpoints
custom_code
text-generation-inference
Eval Results
4-bit precision
8-bit precision
Carbon Emissions
Misc with no match
Merge
text-embeddings-inference
Mixture of Experts
Apply filters
Models
256
Full-text search
Edit filters
Sort: Trending
Active filters:
vision-language
Clear all
scb10x/typhoon-ocr-7b
Image-Text-to-Text
•
8B
•
Updated
5 days ago
•
17.5k
•
57
remyxai/SpaceOm
Image-Text-to-Text
•
4B
•
Updated
about 15 hours ago
•
716
•
7
scb10x/typhoon-ocr-3b
Image-Text-to-Text
•
4B
•
Updated
5 days ago
•
323
•
3
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
0.7B
•
Updated
Feb 4
•
26.6k
•
1.49k
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text
•
0.6B
•
Updated
Jan 31
•
36.6k
•
211
lusxvr/nanoVLM-222M
Image-Text-to-Text
•
0.2B
•
Updated
May 8
•
2.21k
•
89
smolagents/Qwen2.5-VL-3B-Instruct-Agentic
Image-Text-to-Text
•
4B
•
Updated
3 days ago
•
4
•
2
remyxai/SpaceQwen2.5-VL-3B-Instruct
Image-Text-to-Text
•
4B
•
Updated
1 day ago
•
90.9k
•
14
mradermacher/SpaceQwen2.5-VL-3B-Instruct-GGUF
Robotics
•
3B
•
Updated
7 days ago
•
244
•
1
mradermacher/SpaceQwen2.5-VL-3B-Instruct-i1-GGUF
Robotics
•
3B
•
Updated
7 days ago
•
271
•
1
mradermacher/SpaceOm-GGUF
3B
•
Updated
8 days ago
•
325
•
2
mradermacher/SpaceOm-i1-GGUF
3B
•
Updated
8 days ago
•
630
•
2
rinabuoy/nanoVLM
Image-Text-to-Text
•
0.2B
•
Updated
10 days ago
•
30
•
2
SaltySander/MOSAIC
Updated
4 days ago
•
1
lil-lab/kilogram-models
Updated
Aug 17, 2024
stabilityai/japanese-stable-vlm
Image-to-Text
•
8B
•
Updated
Jul 10, 2024
•
18
•
49
wayveai/Lingo-Judge
Text Classification
•
Updated
Mar 19, 2024
•
4.9k
•
4
bczhou/tiny-llava-v1-hf
Image-Text-to-Text
•
1B
•
Updated
Aug 17, 2024
•
4.72k
•
57
bczhou/TinyLLaVA-3.1B
Text Generation
•
3B
•
Updated
Mar 25, 2024
•
463
•
27
bczhou/TinyLLaVA-2.0B
Image-Text-to-Text
•
2B
•
Updated
Jul 26, 2024
•
755
•
6
bczhou/TinyLLaVA-1.5B
Image-Text-to-Text
•
2B
•
Updated
Jun 14, 2024
•
263
•
17
SakanaAI/EvoVLM-JP-v1-7B
Image-to-Text
•
8B
•
Updated
Mar 21, 2024
•
23
•
36
HyperGAI/HPT
Updated
May 17, 2024
•
20
•
41
bczhou/TinyLLaVA-3.1B-Pretrain
Text Generation
•
3B
•
Updated
Mar 25, 2024
•
25
HyperGAI/HPT1_5-Air-Llama-3-8B-Instruct-multimodal
Text Generation
•
Updated
May 15, 2024
•
29
•
46
HyperGAI/HPT1_5-Edge
Text Generation
•
Updated
Jun 5, 2024
•
47
•
9
McGill-NLP/AURORA
Image-to-Image
•
Updated
Dec 21, 2024
•
5
•
4
SakanaAI/Llama-3-EvoVLM-JP-v2
Image-to-Text
•
8B
•
Updated
Aug 1, 2024
•
7.65k
•
20
AXCXEPT/Llama-3-EZO-VLM-1
Image-to-Text
•
8B
•
Updated
Aug 23, 2024
•
21
•
7
mallapraveen/GOT-OCR2_0
Image-Text-to-Text
•
0.7B
•
Updated
Sep 15, 2024
•
20
Previous
1
2
3
...
9
Next