Pritam Sarkar
pritamqu
·
AI & ML interests
multimodal learning with vision, language, and audio; generative modeling; large multimodal models (LMMs); multimodal LLMs (MLLMs); AI agents; alignments; representation learning; self-supervised and unsupervised learning; vision-language models; audio-visual models; foundation models; computer vision
Recent Activity
Organizations
None yet
pritamqu's activity
-
-
-
-
-
-
-
-
-
-
-
upvoted
a
paper
about 2 months ago