Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DAMO-NLP-SG
/
VideoRefer-VideoLLaMA3-2B
like
4
Follow
Language Technology Lab at Alibaba DAMO Academy
133
Video-Text-to-Text
Transformers
Safetensors
English
videollama3_qwen2
text-generation
multimodal large language model
large video-language model
arxiv:
4 papers
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
refs/pr/1
VideoRefer-VideoLLaMA3-2B
Ctrl+K
Ctrl+K
2 contributors
History:
7 commits
merve
HF Staff
Fix task tag
617eb6a
verified
11 days ago
.gitattributes
Safe
1.52 kB
initial commit
13 days ago
README.md
Safe
5.02 kB
Fix task tag
11 days ago
added_tokens.json
Safe
706 Bytes
Upload tokenizer
13 days ago
config.json
Safe
1.69 kB
Update config.json
13 days ago
generation_config.json
Safe
242 Bytes
Upload Videollama3Qwen2ForCausalLM
13 days ago
merges.txt
Safe
1.67 MB
Upload tokenizer
13 days ago
model.safetensors
Safe
3.93 GB
LFS
Upload Videollama3Qwen2ForCausalLM
13 days ago
special_tokens_map.json
Safe
613 Bytes
Upload tokenizer
13 days ago
tokenizer_config.json
Safe
7.05 kB
Upload tokenizer
13 days ago
vocab.json
Safe
3.38 MB
Upload tokenizer
13 days ago