Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
xinyiW915
/
ReLaX-VQA
like
1
Visual Question Answering
5 datasets
deep-learning
vision
VQA
Transformer
CNN
arxiv:
2407.11496
License:
apache-2.0
Model card
Files
Files and versions
Community
main
ReLaX-VQA
Ctrl+K
Ctrl+K
1 contributor
History:
15 commits
Xinyi Wang
update README
045b2f8
3 months ago
metadata
first commit
3 months ago
model
Upload model
3 months ago
src
first commit
3 months ago
ugc_original_videos
first commit
3 months ago
.gitattributes
Safe
1.6 kB
first commit
3 months ago
.gitignore
Safe
78 Bytes
Update
3 months ago
Framework.png
Safe
18.9 MB
LFS
first commit
3 months ago
README.md
9.52 kB
update README
3 months ago
reported_result.ipynb
Safe
66.8 kB
first commit
3 months ago
requirements.txt
Safe
2.57 kB
first commit
3 months ago