license: apache-2.0 | |
base_model: Qwen/Qwen2.5-VL-7B-Instruct | |
tags: | |
- vision | |
- multimodal | |
- qwen2.5-vl | |
- fine-tuned | |
language: | |
- en | |
pipeline_tag: image-text-to-text | |
# Fine-tuned Qwen2.5-VL Model | |
This is a fine-tuned version of Qwen/Qwen2.5-VL-7B-Instruct trained using Axolotl. | |
## Model Details | |
- **Base Model**: Qwen/Qwen2.5-VL-7B-Instruct | |
- **Training Framework**: Axolotl | |
- **Training Type**: LoRA Fine-tuning (language model only) | |
## Training Configuration | |
- Learning Rate: 0.0002 | |
- Optimizer: adamw_8bit | |
- Scheduler: cosine | |
- Precision: bf16 | |
- Checkpoints: Disabled for efficiency | |