Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
lmms-lab
's Collections
VideoMMMU
Multimodal-SAE
LLaVA-Critic
LLaVA-Video
LLaVA-OneVision
LMMs-Eval
LongVA
LLaVA-Next-Interleave
LLaVA-NeXT
LMMs-Eval-Lite
LLaVA-OneVision
updated
Oct 5, 2024
a model good at arbitrary types of visual input
Upvote
22
+12
LLaVA-OneVision: Easy Visual Task Transfer
Paper
•
2408.03326
•
Published
Aug 6, 2024
•
60
lmms-lab/LLaVA-OneVision-Mid-Data
Viewer
•
Updated
Aug 26, 2024
•
563k
•
354
•
16
lmms-lab/LLaVA-OneVision-Data
Viewer
•
Updated
Oct 22, 2024
•
3.72M
•
9.44k
•
160
lmms-lab/LLaVA-NeXT-Data
Viewer
•
Updated
Aug 30, 2024
•
779k
•
2.36k
•
28
lmms-lab/llavanext-qwen-siglip-tokenizer
Text Generation
•
Updated
Jul 11, 2024
•
277
•
3
lmms-lab/llava-onevision-qwen2-0.5b-si
Text Generation
•
Updated
Sep 2, 2024
•
9.28k
•
13
lmms-lab/llava-onevision-qwen2-0.5b-ov
Text Generation
•
Updated
Sep 2, 2024
•
57.9k
•
15
lmms-lab/llava-onevision-qwen2-7b-si
Text Generation
•
Updated
Sep 2, 2024
•
13.9k
•
12
lmms-lab/llava-onevision-qwen2-7b-ov
Text Generation
•
Updated
Sep 2, 2024
•
224k
•
44
lmms-lab/llava-onevision-qwen2-72b-si
Text Generation
•
Updated
Sep 2, 2024
•
440
•
1
lmms-lab/llava-onevision-qwen2-72b-ov-sft
Text Generation
•
Updated
Sep 2, 2024
•
2.84k
•
14
lmms-lab/llava-onevision-qwen2-72b-ov-chat
Image-Text-to-Text
•
Updated
Oct 9, 2024
•
584
•
8
lmms-lab/llava-onevision-projectors
Updated
Aug 14, 2024
•
3
lmms-lab/llava-onevision-qwen2-0.5b-mid-stage-a4
Updated
Aug 6, 2024
•
133
lmms-lab/llava-onevision-qwen2-7b-mid-stage-a4
Updated
Aug 6, 2024
•
3.21k
Upvote
22
+18
Share collection
View history
Collection guide
Browse collections