Zhang199's picture
Update README.md
241f3ad verified
metadata
license: apache-2.0
pipeline_tag: video-text-to-text
library_name: transformers

TinyLLaVA-Video-R1

arXivGithub

This model is obtained by cold-starting TinyLLaVA-Video with 16 manually annotated samples from the NextQA dataset. It serves as the base model for TinyLLaVA-Video-R1.

The 16 manually annotated samples used for cold-starting have been released here.