video/image - a dbest111 Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

dbest111 's Collections

video/image

updated Jul 24, 2025

google/vit-base-patch16-224

Image Classification • 86.6M • Updated Sep 5, 2023 • 4.61M • • 988
OpenGVLab/internimage_g_jointto22k_384

Image Classification • 3B • Updated Mar 25, 2025 • 12 • 1
chancharikm/qwen2.5-vl-72b-cam-motion

Video-Text-to-Text • 73B • Updated Sep 19, 2025 • 107 • 1
lmms-lab/Aero-1-Audio

Text Generation • 2B • Updated Jun 7, 2025 • 177 • 91
mipal/AVATAR

Updated Nov 3, 2025 • 3.17k • 1
FAVOR-Bench/FAVOR

Viewer • Updated May 11 • 27.1k • 570 • 3
lmms-lab/VideoMMMU

Viewer • Updated May 5, 2025 • 900 • 2.48k • 15
moonshotai/Kimi-VL-A3B-Thinking-2506

Image-Text-to-Text • 16B • Updated Jan 30 • 7.47k • 371
lmms-lab/llava-critic-113k

Viewer • Updated Oct 5, 2024 • 113k • 540 • 28
lmms-lab/M4-Instruct-Data

Updated Jul 21, 2024 • 923 • 79
lmms-lab/llava-next-interleave-qwen-7b

Text Generation • 8B • Updated Jul 24, 2024 • 101 • 27
lmms-lab/LLaVA-OneVision-Data

Viewer • Updated May 24, 2025 • 3.94M • 25.2k • 238
avalab/syndicom

Viewer • Updated May 10, 2024 • 19.2k • 27
avalab/iTBLS

Viewer • Updated Jan 17, 2025 • 12.5k • 6
Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

Paper • 2312.14378 • Published Dec 22, 2023
avalab/cTBLS_knowledge_retriever

Updated Jan 12, 2024
avalab/cTBLS_encoder

Updated Apr 27, 2023
CraftJarvis/minecraft-vla-sft

Viewer • Updated Mar 21, 2025 • 3.78M • 1.35k • 10

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs