Update README.md
Browse files
README.md
CHANGED
@@ -160,11 +160,18 @@ model-index:
|
|
160 |
|
161 |
---
|
162 |
|
163 |
-
# LLaVA-Next-Inst-It-Vicuna-7B
|
164 |
-
introduced in the paper [Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning](https://huggingface.co/papers/2412.03565)
|
165 |
-
|
166 |
[**π Homepage**](https://inst-it.github.io/) | [**Code**](https://github.com/inst-it/inst-it) | [**π€ Paper**](https://huggingface.co/papers/2412.03565) | [**π arXiv**](https://arxiv.org/abs/2412.03565)
|
167 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
168 |
## Quick Start
|
169 |
**Install**
|
170 |
|
|
|
160 |
|
161 |
---
|
162 |
|
163 |
+
# LLaVA-Next-Inst-It-Vicuna-7B
|
|
|
|
|
164 |
[**π Homepage**](https://inst-it.github.io/) | [**Code**](https://github.com/inst-it/inst-it) | [**π€ Paper**](https://huggingface.co/papers/2412.03565) | [**π arXiv**](https://arxiv.org/abs/2412.03565)
|
165 |
|
166 |
+
LLaVA-Next-Inst-It-Vicuna-7B is a multimodal model that excels at instance-level understanding,
|
167 |
+
which is introduced in the paper [Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning](https://huggingface.co/papers/2412.03565)
|
168 |
+
|
169 |
+
* **Architecture**: clip-vit-large-patch14-336 + Vicuna-7B
|
170 |
+
* **Initialized Model**: LLaVA-NeXT
|
171 |
+
* **Data**: LLaVA-NeXT-Data / Inst-IT-Dataset
|
172 |
+
* **Precision**: bfloat16
|
173 |
+
|
174 |
+
|
175 |
## Quick Start
|
176 |
**Install**
|
177 |
|