Edit Models filters

Apps

Apps with no match

Inference Providers

HF Inference API

Inference Providers with no match

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

Mixture of Experts

Misc with no match

text-embeddings-inference

Carbon Emissions

Models

1,242

Full-text search

Active filters: multimodal

csfufu/Revisual-R1-final

Image-Text-to-Text • 8B • Updated 23 days ago • 640 • 5

unsloth/Qwen2.5-VL-7B-Instruct-GGUF

Image-Text-to-Text • 8B • Updated May 12 • 9.5k • 9

unsloth/Qwen2.5-VL-32B-Instruct-GGUF

Image-Text-to-Text • 33B • Updated May 12 • 4.67k • 4

zhaode/FastVLM-0.5B-Stage2

Image-Text-to-Text • 0.8B • Updated May 20 • 103 • 1

stockmark/Stockmark-2-VL-100B-beta

Image-Text-to-Text • 96B • Updated 25 days ago • 1.95k • 18

imageomics/bioclip-2

Zero-Shot Image Classification • Updated 22 days ago • 5.08k • 10

lingshu-medical-mllm/Lingshu-32B

Image-Text-to-Text • 33B • Updated 3 days ago • 1.84k • 38

Sungyeon/GENIUS

Visual Document Retrieval • Updated 22 days ago • 1

humbleakh/qwen2.5-vl-3b-8bit-chain-of-zoom

Image-to-Text • Updated 20 days ago • 72 • 1

mehmetkuzucu/Waffle-v1.0

Visual Question Answering • 0.2B • Updated 18 days ago • 98 • 4

mradermacher/SpaceOm-GGUF

3B • Updated 7 days ago • 325 • 2

mradermacher/SpaceOm-i1-GGUF

3B • Updated 7 days ago • 630 • 2

rinabuoy/nanoVLM

Image-Text-to-Text • 0.2B • Updated 9 days ago • 30 • 2

adriabama06/UI-TARS-1.5-7B-Q4_K_M-GGUF

Image-Text-to-Text • 8B • Updated 7 days ago • 22 • 1

adriabama06/UI-TARS-1.5-7B-GGUF

Image-Text-to-Text • 8B • Updated 7 days ago • 134 • 1

avin-255/nanoVLM

Image-Text-to-Text • 0.2B • Updated 7 days ago • 18 • 1

thesby/Qwen2.5-VL-7B-NSFW-Caption-V3

Image-Text-to-Text • 8B • Updated 11 days ago • 168 • 7

sujitpal/clip-imageclef

Zero-Shot Image Classification • Updated Oct 31, 2023 • 60 • 3

waybarrios/guidance-based-video-grounding

Updated Apr 1, 2023

MonoHime/mosei-senti-intermodal

Feature Extraction • Updated May 18, 2023 • 52

MonoHime/mosei-emo-intermodal

Feature Extraction • Updated May 18, 2023 • 39

MonoHime/iemocap-emo-intermodal

Feature Extraction • Updated May 18, 2023 • 23

MonoHime/mosi-senti-intermodal

Feature Extraction • Updated May 18, 2023 • 44

MonoHime/meld-emo-intermodal

Feature Extraction • Updated May 18, 2023 • 17

imageomics/bioclip

Zero-Shot Image Classification • Updated May 17, 2024 • 33k • 49

HuggingFaceM4/idefics-80b

Text Generation • 80B • Updated Oct 12, 2023 • 43 • 70

HuggingFaceM4/idefics-9b

Text Generation • 9B • Updated Oct 12, 2023 • 1.4k • 46

HuggingFaceM4/idefics-80b-instruct

Text Generation • 80B • Updated Oct 12, 2023 • 1.93k • 189

typeof/idefics-9b

Text Generation • Updated Oct 13, 2023 • 24

sshh12/Mistral-7B-LoRA-VisionCLIP-LLAVA

Text Generation • Updated Oct 28, 2023 • 79 • 10