Yueze Wang's picture

2 7 3

Yueze Wang

yzwang

·

https://yuezewang.github.io/

AI & ML interests

Multi-modal

Recent Activity

updated a dataset 13 minutes ago

OmniGen2/X2I2

updated a dataset about 1 hour ago

OmniGen2/X2I2

updated a dataset about 1 hour ago

OmniGen2/X2I2

View all activity

Organizations

authored a paper 4 days ago

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published 5 days ago • 67

authored a paper 26 days ago

MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos

Paper • 2502.12558 • Published Feb 18

authored a paper 5 months ago

EVEv2: Improved Baselines for Encoder-Free Vision-Language Models

Paper • 2502.06788 • Published Feb 10 • 13

authored 2 papers 6 months ago

Seeing Clearly, Answering Incorrectly: A Multimodal Robustness Benchmark for Evaluating MLLMs on Leading Questions

Paper • 2406.10638 • Published Jun 15, 2024

MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval

Paper • 2412.14475 • Published Dec 19, 2024 • 55

authored 8 papers 9 months ago

Fine-Grained Visual Prompting

Paper • 2306.04356 • Published Jun 7, 2023

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 95

Generative Multimodal Models are In-Context Learners

Paper • 2312.13286 • Published Dec 20, 2023 • 37

Generative Pretraining in Multimodality

Paper • 2307.05222 • Published Jul 11, 2023 • 22

Efficient Multimodal Learning from Data-centric Perspective

Paper • 2402.11530 • Published Feb 18, 2024 • 1

Unveiling Encoder-Free Vision-Language Models

Paper • 2406.11832 • Published Jun 17, 2024 • 55

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

Paper • 2407.08303 • Published Jul 11, 2024 • 19

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 116