Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DAMO-NLP-SG
/
VideoRefer-VideoLLaMA3-2B
like
4
Follow
Language Technology Lab at Alibaba DAMO Academy
133
Video-Text-to-Text
Transformers
Safetensors
English
videollama3_qwen2
text-generation
multimodal large language model
large video-language model
arxiv:
4 papers
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
3438a12
VideoRefer-VideoLLaMA3-2B
Ctrl+K
Ctrl+K
2 contributors
History:
2 commits
CircleRadon
Upload Videollama3Qwen2ForCausalLM
3438a12
verified
14 days ago
.gitattributes
Safe
1.52 kB
initial commit
14 days ago
README.md
Safe
5.17 kB
Upload Videollama3Qwen2ForCausalLM
14 days ago
config.json
1.76 kB
Upload Videollama3Qwen2ForCausalLM
14 days ago
generation_config.json
Safe
242 Bytes
Upload Videollama3Qwen2ForCausalLM
14 days ago
model.safetensors
Safe
3.93 GB
LFS
Upload Videollama3Qwen2ForCausalLM
14 days ago