Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 12
Inference Providers
Fireworks
Cerebras
Novita
Together AI
Nebius AI
Groq
Hyperbolic
Nscale
+ 6
Apply filters
Models
5,312
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
Salesforce/blip2-opt-2.7b
Image-Text-to-Text
•
4B
•
Updated
Feb 3
•
728k
•
406
Salesforce/instructblip-vicuna-7b
Image-Text-to-Text
•
8B
•
Updated
Feb 3
•
42.4k
•
95
liuhaotian/llava-v1.5-13b
Image-Text-to-Text
•
Updated
May 9, 2024
•
81.8k
•
509
liuhaotian/llava-v1.6-vicuna-7b
Image-Text-to-Text
•
7B
•
Updated
May 9, 2024
•
21k
•
131
deepseek-ai/deepseek-vl-7b-chat
Image-Text-to-Text
•
7B
•
Updated
Mar 15, 2024
•
56.8k
•
261
OpenGVLab/InternVL-Chat-V1-5
Image-Text-to-Text
•
26B
•
Updated
Mar 25
•
2.48k
•
413
HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text
•
8B
•
Updated
Jul 30, 2024
•
551
•
95
google/paligemma-3b-pt-224
Image-Text-to-Text
•
3B
•
Updated
Sep 21, 2024
•
42.5k
•
342
abhi-8/Age-gender-predictor
Image-Text-to-Text
•
Updated
May 23, 2024
•
2
openvla/openvla-7b
Image-Text-to-Text
•
8B
•
Updated
Sep 16, 2024
•
122k
•
135
microsoft/Florence-2-base
Image-Text-to-Text
•
0.2B
•
Updated
11 days ago
•
495k
•
289
microsoft/Florence-2-large-ft
Image-Text-to-Text
•
0.8B
•
Updated
11 days ago
•
41.3k
•
362
microsoft/Florence-2-base-ft
Image-Text-to-Text
•
0.2B
•
Updated
11 days ago
•
24.8k
•
126
OpenGVLab/InternVL2-4B
Image-Text-to-Text
•
4B
•
Updated
Mar 25
•
118k
•
53
OpenGVLab/InternVL2-1B
Image-Text-to-Text
•
0.9B
•
Updated
Mar 25
•
146k
•
77
llava-hf/llama3-llava-next-8b-hf
Image-Text-to-Text
•
8B
•
Updated
Jan 27
•
104k
•
44
llava-hf/llava-onevision-qwen2-0.5b-ov-hf
Image-Text-to-Text
•
0.9B
•
Updated
Jun 18
•
191k
•
41
llava-hf/llava-onevision-qwen2-7b-ov-hf
Image-Text-to-Text
•
8B
•
Updated
Jun 18
•
51k
•
33
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
•
4B
•
Updated
Sep 26, 2024
•
586k
•
702
TheFinAI/FinLLaVA
Image-Text-to-Text
•
8B
•
Updated
Aug 28, 2024
•
229
•
16
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Feb 6
•
546k
•
•
1.22k
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int4
Image-Text-to-Text
•
1B
•
Updated
Sep 21, 2024
•
3.43k
•
25
openvla/openvla-7b-finetuned-libero-10
Image-Text-to-Text
•
8B
•
Updated
Oct 9, 2024
•
2.91k
•
4
Qwen/Qwen2-VL-2B
Image-Text-to-Text
•
2B
•
Updated
Dec 6, 2024
•
127k
•
51
Qwen/Qwen2-VL-7B
Image-Text-to-Text
•
8B
•
Updated
Jan 12
•
3.99k
•
57
allenai/MolmoE-1B-0924
Image-Text-to-Text
•
Updated
Apr 24
•
2.79k
•
151
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
8B
•
Updated
Apr 4
•
19.4k
•
541
allenai/Molmo-72B-0924
Image-Text-to-Text
•
73B
•
Updated
Jun 19
•
2.65k
•
288
mPLUG/DocOwl2
Image-Text-to-Text
•
9B
•
Updated
Sep 27, 2024
•
630
•
111
nvidia/NVLM-D-72B
Image-Text-to-Text
•
79B
•
Updated
Jan 14
•
50.7k
•
772
Previous
1
...
3
4
5
6
7
...
100
Next