Describing short clips

#3
by fishcakeday - opened

When I try to ask the model to describe a short clip with very few frames, it always fails to identify any actions or movements, only talking about the overall description. Trying it with and without do_sample makes no difference. Any way I can use this setup to describe 2-5 second clips?

OpenGVLab org
  1. Have you tried the 7B model?
  2. What's your prompt?
  3. Is this true for all short videos, or is it just a case? You can send corresponding videos to my email address [email protected]

Sign up or log in to comment