Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
Jinhui Yi*, Syed Talal Wasim*, Yanan Luo*, Muzammal Naseer, Juergen Gall
*Equal Contribution
University of Bonn; Lamarr Institute for Machine Learning and Artificial Intelligence; Khalifa University
Model Weights
We release the pretrained and instruction-tuned weights of Video-Panda in this repository.
โ๏ธ Citation
If Video-Panda is helpful for your research, please consider star โญ and citation ๐ :
@article{yi2024video-panda,
author = {Jinhui Yi* and Syed Talal Wasim* and Yanan Luo* and Muzammal Naseer and Juergen Gall},
title = {Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models},
journal = {arXiv preprint, arXiv:2412.18609},
year = {2024},
}
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the HF Inference API does not support transformers models with pipeline type video-text-to-text