TinyLLaVA-Video-R1

This model is obtained by cold-starting TinyLLaVA-Video with 16 manually annotated samples from the NextQA dataset. It serves as the base model for TinyLLaVA-Video-R1.

The 16 manually annotated samples used for cold-starting have been released here.

Downloads last month: 10

Safetensors

Model size

4B params

Tensor type

BF16

Inference Providers NEW

Video-Text-to-Text

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including Zhang199/TinyLLaVA-Video-Coldstart_NextQA_16

TinyLLaVA-Video-R1

Collection

Towards Smaller LMMs for Video Reasoning. • 4 items • Updated Apr 15, 2025 • 1

Paper for Zhang199/TinyLLaVA-Video-Coldstart_NextQA_16

TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning

Paper • 2504.09641 • Published Apr 13, 2025 • 16