LLaVA-OV-Manager / README.md
LooperXX's picture
Add pipeline tag, transformers library name, and link to Hugging Face paper (#1)
c0e3e5e verified
metadata
base_model:
  - Qwen/Qwen2-0.5B-Instruct
  - google/siglip-so400m-patch14-384
datasets:
  - liuhaotian/LLaVA-Pretrain
  - lmms-lab/LLaVA-ReCap-558K
  - lmms-lab/LLaVA-ReCap-118K
  - lmms-lab/LLaVA-ReCap-CC3M
  - lmms-lab/LLaVA-OneVision-Mid-Data
  - lmms-lab/LLaVA-OneVision-Data
  - Zhiqiang007/MathV360K
language:
  - en
license: mit
pipeline_tag: image-text-to-text
library_name: transformers
tags:
  - LLaVA-OneVision-Manager
  - LLaVA-OV-Manager
  - Manager

Model weights for our submission to TCSVT, titled "Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs".

Related materials can be found at Paper, Code, https://looperxx.github.io/.