arxiv:2501.05122
Flo Schneider
floschne
AI & ML interests
Large Vision-Language Models, Cross-modal Retrieval
Recent Activity
liked
a model
2 days ago
Gregor/mblip-mt0-xl
liked
a model
2 days ago
WueNLP/centurio_aya
authored
a paper
6 days ago
Why do LLaVA Vision-Language Models Reply to Images in English?
Organizations
models
None public yet
datasets
14
floschne/wismir3
Viewer
•
Updated
•
301k
•
107
floschne/xflickrco_1k
Viewer
•
Updated
•
8k
•
44
•
1
floschne/xflickrco
Viewer
•
Updated
•
16k
•
76
•
1
floschne/xgqa_1k
Viewer
•
Updated
•
8k
•
43
floschne/xvnli
Viewer
•
Updated
•
5.82k
•
36
floschne/xgqa
Viewer
•
Updated
•
77.3k
•
96
floschne/xm3600_1k
Updated
•
106
floschne/xm3600
Updated
•
53
•
5
floschne/m5b_vlod
Viewer
•
Updated
•
1.42k
•
31
floschne/m5b_vgr
Viewer
•
Updated
•
1.43k
•
31