| license: apache-2.0 | |
| base_model: Qwen/Qwen2.5-VL-7B-Instruct | |
| tags: | |
| - vision | |
| - multimodal | |
| - qwen2.5-vl | |
| - fine-tuned | |
| language: | |
| - en | |
| pipeline_tag: image-text-to-text | |
| # Fine-tuned Qwen2.5-VL Model | |
| This is a fine-tuned version of Qwen/Qwen2.5-VL-7B-Instruct trained using Axolotl. | |
| ## Model Details | |
| - **Base Model**: Qwen/Qwen2.5-VL-7B-Instruct | |
| - **Training Framework**: Axolotl | |
| - **Training Type**: LoRA Fine-tuning (language model only) | |
| ## Training Configuration | |
| - Learning Rate: 0.0002 | |
| - Optimizer: adamw_8bit | |
| - Scheduler: cosine | |
| - Precision: bf16 | |
| - Checkpoints: Disabled for efficiency | |