Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Replicate
Together AI
Cerebras
Nebius AI Studio
Fireworks
Novita
Hyperbolic
Cohere
fal
SambaNova
HF Inference API
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
custom_code
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
905
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
HuggingFaceM4/idefics-80b-instruct
Text Generation
•
Updated
Oct 12, 2023
•
1.33k
•
188
sshh12/Mistral-7B-LoRA-Multi-VisionCLIPPool-LLAVA
Image-Text-to-Text
•
Updated
Mar 27, 2024
•
2
•
3
remyxai/SpaceLLaVA
Image-Text-to-Text
•
Updated
24 days ago
•
666
•
23
nielsr/imagebind-huge
Updated
Apr 28, 2024
•
556
•
17
qnguyen3/nanoLLaVA
Text Generation
•
Updated
Oct 27, 2024
•
3.36k
•
154
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
Updated
Oct 14, 2024
•
18.6k
•
602
Lewdiculous/Aura_v2_7B-GGUF-IQ-Imatrix
Updated
Apr 16, 2024
•
236
•
13
chenjoya/videollm-online-8b-v1plus
Video-Text-to-Text
•
Updated
Jul 13, 2024
•
2.21k
•
24
lmms-lab/llava-onevision-qwen2-7b-ov
Text Generation
•
Updated
Sep 2, 2024
•
93.6k
•
50
openvla/openvla-7b-prismatic
Image-Text-to-Text
•
Updated
Jul 9, 2024
•
133
•
5
lmms-lab/llava-onevision-qwen2-0.5b-ov
Text Generation
•
Updated
Sep 2, 2024
•
34.3k
•
18
HuggingFaceM4/Idefics3-8B-Llama3
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
47.1k
•
278
robotics-diffusion-transformer/rdt-1b
Robotics
•
Updated
Oct 17, 2024
•
1.75k
•
81
lmms-lab/LLaVA-Video-72B-Qwen2
Text Generation
•
Updated
Oct 25, 2024
•
913
•
19
lmms-lab/LLaVA-Video-7B-Qwen2
Video-Text-to-Text
•
Updated
Oct 25, 2024
•
39.3k
•
91
Qwen/Qwen2-VL-7B
Image-Text-to-Text
•
Updated
Jan 12
•
40.9k
•
52
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Apr 4
•
31.3k
•
525
allenai/Molmo-7B-O-0924
Image-Text-to-Text
•
Updated
Nov 15, 2024
•
3.13k
•
158
allenai/Molmo-72B-0924
Image-Text-to-Text
•
Updated
Oct 10, 2024
•
1.34k
•
284
unsloth/Llama-3.2-11B-Vision
Image-Text-to-Text
•
Updated
Nov 22, 2024
•
571
•
32
unsloth/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
•
Updated
Dec 10, 2024
•
38.5k
•
79
rhymes-ai/Aria
Image-Text-to-Text
•
Updated
21 days ago
•
15.4k
•
628
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
Jan 14
•
15.3k
•
770
adamo1139/Qwen2-VL-7B-Sydney
Image-Text-to-Text
•
Updated
Feb 1
•
12
•
5
Rewatiramans/Dermatech-Qwen2-VL-2B
Text Generation
•
Updated
Oct 15, 2024
•
134
•
3
rhymes-ai/Aria-sequential_mlp
Image-Text-to-Text
•
Updated
Nov 22, 2024
•
45
•
17
sahilnishad/Florence-2-FT-DocVQA
Question Answering
•
Updated
Nov 7, 2024
•
4.85k
•
1
NexaAIDev/OmniVLM-968M
Updated
Dec 17, 2024
•
2.34k
•
517
unsloth/llava-v1.6-mistral-7b-hf-bnb-4bit
Image-Text-to-Text
•
Updated
Feb 13
•
2.94k
•
6
NCSOFT/VARCO-VISION-14B-HF
Image-Text-to-Text
•
Updated
Mar 17
•
714
•
24
Previous
1
2
3
4
...
31
Next