Text Generation
Transformers
PyTorch
geochat
kartik060702 commited on
Commit
cca91eb
·
verified ·
1 Parent(s): 9194958

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -1,3 +1,36 @@
1
  ---
2
  license: apache-2.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
  ---
4
+
5
+ # GeoChat-7B
6
+
7
+ GeoChat is the first grounded Large Vision Language Model, specifically tailored to Remote Sensing(RS) scenarios. Unlike general-domain models, GeoChat excels in handling high-resolution RS imagery, employing region-level reasoning for comprehensive scene interpretation. Leveraging a newly created RS multimodal dataset, GeoChat is fine-tuned using the LLaVA-1.5 architecture. This results in robust zero-shot performance across various RS tasks, including image and region captioning, visual question answering, scene classification, visually grounded conversations, and referring object detection.
8
+
9
+ <!-- Provide a longer summary of what this model is. -->
10
+ - **Developed by MBZUAI**
11
+
12
+ ### Model Sources
13
+
14
+ <!-- Provide the basic links for the model. -->
15
+
16
+ - **Repository:** https://github.com/mbzuai-oryx/GeoChat
17
+ - **Paper:** https://arxiv.org/abs/2311.15826
18
+
19
+ **BibTeX:**
20
+
21
+ ```bibtex
22
+ @misc{kuckreja2023geochat,
23
+ title={GeoChat: Grounded Large Vision-Language Model for Remote Sensing},
24
+ author={Kartik Kuckreja and Muhammad Sohail Danish and Muzammal Naseer and Abhijit Das and Salman Khan and Fahad Shahbaz Khan},
25
+ year={2023},
26
+ eprint={2311.15826},
27
+ archivePrefix={arXiv},
28
+ primaryClass={cs.CV}
29
+ }
30
+ ```
31
+ ## Authors
32
+ Kartik Kuckreja, Muhammad Sohail
33
+
34
+ ## Contact
35
36
+