DAMO-NLP-SG
/

VideoRefer-VideoLLaMA3-7B

Video-Text-to-Text

videollama3_qwen2

text-generation

multimodal large language model

large video-language model

Model card Files Files and versions

VideoRefer-VideoLLaMA3-7B

16.1 GB

2 contributors

History: 2 commits

CircleRadon's picture

Upload Videollama3Qwen2ForCausalLM

b899a04 verified 3 months ago

.gitattributes

1.52 kB

initial commit 3 months ago
README.md

5.17 kB

Upload Videollama3Qwen2ForCausalLM 3 months ago
config.json

1.92 kB

Upload Videollama3Qwen2ForCausalLM 3 months ago
generation_config.json

243 Bytes

Upload Videollama3Qwen2ForCausalLM 3 months ago
model-00001-of-00004.safetensors

4.87 GB
xet

Upload Videollama3Qwen2ForCausalLM 3 months ago
model-00002-of-00004.safetensors

4.93 GB
xet

Upload Videollama3Qwen2ForCausalLM 3 months ago
model-00003-of-00004.safetensors

4.99 GB
xet

Upload Videollama3Qwen2ForCausalLM 3 months ago
model-00004-of-00004.safetensors

1.32 GB
xet

Upload Videollama3Qwen2ForCausalLM 3 months ago
model.safetensors.index.json

84.8 kB

Upload Videollama3Qwen2ForCausalLM 3 months ago