Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 12
Inference Providers
Fireworks
Cerebras
Novita
Together AI
Nebius AI
Groq
Hyperbolic
Nscale
+ 6
Apply filters
Models
5,312
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
stepfun-ai/step3
Image-Text-to-Text
•
321B
•
Updated
13 days ago
•
994
•
143
MizzenAI/HPSv3
Image-Text-to-Text
•
Updated
2 days ago
•
13
•
18
QuantTrio/GLM-4.5V-AWQ
Image-Text-to-Text
•
17B
•
Updated
3 days ago
•
468
•
10
Qwen/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
•
4B
•
Updated
Apr 6
•
3.73M
•
486
ByteDance-Seed/UI-TARS-1.5-7B
Image-Text-to-Text
•
8B
•
Updated
Apr 18
•
107k
•
348
inclusionAI/ViLaSR
Image-Text-to-Text
•
8B
•
Updated
4 days ago
•
17.8k
•
16
moonshotai/Kimi-VL-A3B-Thinking-2506
Image-Text-to-Text
•
16B
•
Updated
15 days ago
•
36.8k
•
263
xlangai/OpenCUA-32B
Image-Text-to-Text
•
33B
•
Updated
about 14 hours ago
•
37
•
9
microsoft/Florence-2-large
Image-Text-to-Text
•
0.8B
•
Updated
11 days ago
•
1.13M
•
1.63k
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text
•
33B
•
Updated
Apr 14
•
512k
•
•
424
unsloth/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
8B
•
Updated
May 12
•
38.4k
•
30
google/medgemma-27b-it
Image-Text-to-Text
•
29B
•
Updated
Jul 10
•
21.4k
•
173
inference-net/ClipTagger-12b
Image-Text-to-Text
•
12B
•
Updated
1 day ago
•
52
•
7
HuggingFaceTB/SmolVLM2-2.2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Apr 8
•
131k
•
240
moonshotai/Kimi-VL-A3B-Instruct
Image-Text-to-Text
•
16B
•
Updated
16 days ago
•
193k
•
232
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
Image-Text-to-Text
•
Updated
Jun 13
•
545k
•
143
InfiX-ai/InfiGUI-G1-3B
Image-Text-to-Text
•
4B
•
Updated
3 days ago
•
395
•
6
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
•
11B
•
Updated
Sep 27, 2024
•
32.4k
•
542
deepseek-ai/deepseek-vl2
Image-Text-to-Text
•
27B
•
Updated
Dec 18, 2024
•
3.7k
•
356
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
73B
•
Updated
Jun 6
•
767k
•
•
525
HuggingFaceTB/SmolVLM2-500M-Video-Instruct
Image-Text-to-Text
•
0.5B
•
Updated
Apr 8
•
69.2k
•
88
unsloth/gemma-3-4b-it-GGUF
Image-Text-to-Text
•
4B
•
Updated
about 12 hours ago
•
35.3k
•
123
google/gemma-3-12b-it-qat-q4_0-gguf
Image-Text-to-Text
•
12B
•
Updated
Apr 11
•
93.5k
•
170
google/gemma-3n-E4B
Image-Text-to-Text
•
8B
•
Updated
Jul 14
•
17k
•
84
nvidia/VideoITG-8B
Image-Text-to-Text
•
8B
•
Updated
2 days ago
•
9
•
5
InfiX-ai/InfiGUI-G1-7B
Image-Text-to-Text
•
8B
•
Updated
3 days ago
•
163
•
5
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
73B
•
Updated
Jan 12
•
44.5k
•
603
huihui-ai/Qwen2.5-VL-7B-Instruct-abliterated
Image-Text-to-Text
•
8B
•
Updated
Apr 1
•
1.8k
•
23
nvidia/Eagle2.5-8B
Image-Text-to-Text
•
8B
•
Updated
6 days ago
•
12.6k
•
24
microsoft/GUI-Actor-2B-Qwen2-VL
Image-Text-to-Text
•
2B
•
Updated
6 days ago
•
266
•
18
Previous
1
2
3
4
...
100
Next