metadata
license: apache-2.0
pipeline_tag: video-text-to-text
library_name: transformers
This model is obtained by cold-starting TinyLLaVA-Video with 16 manually annotated samples from the NextQA dataset. It serves as the base model for TinyLLaVA-Video-R1.
The 16 manually annotated samples used for cold-starting have been released here.