arxiv:2501.05452
Xingyu Fu
Fiaa
AI & ML interests
NLP, multimodal
Recent Activity
liked
a model
1 day ago
stabilityai/stable-video-diffusion-img2vid-xt
authored
a paper
5 days ago
ReFocus: Visual Editing as a Chain of Thought for Structured Image
Understanding
upvoted
a
paper
5 days ago
ReFocus: Visual Editing as a Chain of Thought for Structured Image
Understanding