BLIP for RSICD image captioning:

blip-image-captioning-base model has been finetuned on the rsicd dataset. Training parameters used are as follows:
- learning_rate = 5e-7
- optimizer = AdamW
- scheduler = ReduceLROnPlateau
- epochs = 5
More details (demo, testing, evaluation, metrics) available at github repo

Downloads last month: 25

Safetensors

Model size

247M params

Tensor type

F32

Inference Providers NEW

Image-to-Text

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Gurveer05
/

blip-image-captioning-base-rscid-finetuned

BLIP for RSICD image captioning:

Dataset used to train Gurveer05/blip-image-captioning-base-rscid-finetuned