MutiModal_Dataset - a L-Hongbin Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

L-Hongbin 's Collections

MutiModal_Paper

LLM

MutiModal_Dataset

Optimizer_Papers

MutiModal_Dataset

updated Dec 27, 2024

BAAI/Infinity-MM

Updated Dec 13, 2024 • 10.4k • 88
MMInstruction/M3IT

Updated Nov 24, 2023 • 4.58k • 123
WildVision/wildvision-chat

Viewer • Updated Aug 30, 2024 • 45.2k • 54 • 19
Spawning/PD12M

Viewer • Updated 28 days ago • 12.4M • 1.55k • 151
lmms-lab/LLaVA-Video-178K

Viewer • Updated Oct 11, 2024 • 1.63M • 6.37k • 106
neulab/MultiUI

Viewer • Updated Nov 22, 2024 • 7.29M • 2.25k • 41
EDGEwww25/EDGE-Dataset

Viewer • Updated Oct 19, 2024 • 1.66M • 59
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Paper • 2409.04429 • Published Sep 6, 2024
Salesforce/blip3-kale

Viewer • Updated 4 days ago • 235M • 6.94k • 35
Marqo/marqo-GS-10M

Viewer • Updated Oct 23, 2024 • 9.81M • 1.07k • 48
JefferyZhan/Language-prompted-Localization-Dataset

Preview • Updated Jul 11, 2024 • 101 • 3
lyan62/FoodieQA

Viewer • Updated Nov 26, 2024 • 392 • 128 • 8
mlfoundations/MINT-1T-HTML

Viewer • Updated Sep 21, 2024 • 623M • 156k • 81
DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding

Paper • 2411.14347 • Published Nov 21, 2024 • 13
THUDM/CogVLM-SFT-311K

Preview • Updated Dec 26, 2023 • 72 • 49
Inst-IT/Inst-IT-Dataset

Viewer • Updated Dec 19, 2024 • 72.5k • 98 • 7
liboaccn/MIT-10M

Viewer • Updated 17 days ago • 10.9M • 67 • 7
syp115/DCE-1M

Viewer • Updated Dec 20, 2024 • 1.09M • 22 • 2
xchen16/CompCap-gpt4

Viewer • Updated Dec 20, 2024 • 110k • 124 • 2
Salesforce/blip3-grounding-50m

Viewer • Updated 4 days ago • 52.4M • 510 • 20

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs