Shawon Ashraf
shawon
AI & ML interests
Multi-Modal NLP, LLM and RAG
Recent Activity
liked
a model
1 day ago
google/medgemma-4b-pt
liked
a Space
1 day ago
enzostvs/deepsite
liked
a model
1 day ago
mistralai/Devstral-Small-2505
Organizations
Collections
6
-
PointArena: Probing Multimodal Grounding Through Language-Guided Pointing
Paper • 2505.09990 • Published • 11 -
Style Customization of Text-to-Vector Generation with Image Diffusion Priors
Paper • 2505.10558 • Published • 15 -
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
Paper • 2505.10046 • Published • 9 -
X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real
Paper • 2505.07096 • Published • 3