Xiaoyi Zhang's picture

3

Xiaoyi Zhang

xyzhang626

·

AI & ML interests

None yet

Recent Activity

commented on a paper 10 days ago

Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding

new activity about 1 year ago

microsoft/Phi-3-vision-128k-instruct:Should Phi-3V provide support in llama.cpp?

authored a paper about 1 year ago

Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators

View all activity

Organizations

None yet

xyzhang626's activity

commented a paper 10 days ago

Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding

Paper • 2505.18079 • Published 22 days ago • 4 •

New activity in microsoft/Phi-3-vision-128k-instruct about 1 year ago

Should Phi-3V provide support in llama.cpp?

#24 opened about 1 year ago by

authored 4 papers about 1 year ago

Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators

Paper • 2306.01242 • Published Jun 2, 2023 • 2

Unifying Layout Generation with a Decoupled Diffusion Model

Paper • 2303.05049 • Published Mar 9, 2023

Understanding Mobile GUI: from Pixel-Words to Screen-Sentences

Paper • 2105.11941 • Published May 25, 2021

Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API

Paper • 2310.04716 • Published Oct 7, 2023

updated 2 models about 1 year ago

xyzhang626/vit-mae-base-patch16-256

Image Feature Extraction • Updated Mar 14, 2024 • 11

xyzhang626/vit-mae-large-patch16-256

Image Feature Extraction • Updated Mar 14, 2024 • 20

updated 2 models over 1 year ago

xyzhang626/dinov2-large-patch16-256

Image Feature Extraction • Updated Mar 13, 2024 • 40

xyzhang626/dinov2-base-patch16-256

Image Feature Extraction • Updated Mar 12, 2024 • 19

New activity in madebyollin/sdxl-vae-fp16-fix over 1 year ago

Curious about the methodology of finetuning

#15 opened over 1 year ago by

Curious about the methodology of finetuning

#15 opened over 1 year ago by