TON-3B-AITZ / README.md
kolerk's picture
Create README.md
afad7c9 verified
metadata
license: apache-2.0
datasets:
  - kolerk/TON-AITZ-SFT
language:
  - en
base_model:
  - Qwen/Qwen2.5-VL-3B-Instruct
pipeline_tag: image-text-to-text

This is the model cited in the paper: Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models.