Zeyi Sun's picture

Zeyi Sun

Zery

·

https://github.com/SunzeY

AI & ML interests

CV

Organizations

authored a paper 8 months ago

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published Aug 27, 2025 • 37

authored a paper 9 months ago

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Paper • 2508.04700 • Published Aug 6, 2025 • 52

authored 2 papers about 1 year ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published Mar 3, 2025 • 86

RelightVid: Temporal-Consistent Diffusion Model for Video Relighting

Paper • 2501.16330 • Published Jan 27, 2025 • 3

authored 2 papers over 1 year ago

V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results

Paper • 2406.11739 • Published Jun 17, 2024

X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models

Paper • 2412.01824 • Published Dec 2, 2024 • 64

authored 2 papers almost 2 years ago

Bootstrap3D: Improving 3D Content Creation with Synthetic Data

Paper • 2406.00093 • Published May 31, 2024 • 1

Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials

Paper • 2404.16829 • Published Apr 25, 2024 • 4

authored 3 papers over 2 years ago

Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases

Paper • 2312.15011 • Published Dec 22, 2023 • 18

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Paper • 2312.03818 • Published Dec 6, 2023 • 34

GPT4Point: A Unified Framework for Point-Language Understanding and Generation

Paper • 2312.02980 • Published Dec 5, 2023 • 9