7 18 7

Zebin You

yyyou

https://yyyouy.github.io/

yyyouy

AI & ML interests

Multimodal learning, generative model

Recent Activity

new activity 1 day ago

GSAI-ML/LLaDA-V:Add library name to model card

new activity 10 days ago

GSAI-ML/LLaDA-V:Add link to paper and pipeline tag

updated a model 10 days ago

GSAI-ML/LLaDA-V

View all activity

Organizations

yyyou's activity

upvoted 2 papers 27 days ago

Scaling Diffusion Transformers Efficiently via μP

Paper • 2505.15270 • Published 29 days ago • 32

LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning

Paper • 2505.16933 • Published 28 days ago • 32

upvoted a paper about 2 months ago

DeepCritic: Deliberate Critique with Large Language Models

Paper • 2505.00662 • Published May 1 • 53

upvoted 4 papers 3 months ago

upvoted a paper 4 months ago

Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Paper • 2501.01423 • Published Jan 2 • 43

upvoted a collection 4 months ago

Preference Datasets for DPO

Collection

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 43

upvoted a paper 4 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 145

upvoted a collection 4 months ago

LMMs-Eval

Collection

Dataset Collection of LMMs-Eval • 36 items • Updated Oct 4, 2024 • 29

upvoted a paper 8 months ago

Simplifying, Stabilizing and Scaling Continuous-Time Consistency Models

Paper • 2410.11081 • Published Oct 14, 2024 • 19

upvoted 2 papers 10 months ago

MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data

Paper • 2406.18790 • Published Jun 26, 2024 • 35

Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models

Paper • 2408.04594 • Published Aug 8, 2024 • 15

upvoted an article 12 months ago

Article

🧨 Diffusers welcomes Stable Diffusion 3

Jun 12, 2024

• 96

upvoted a paper 12 months ago

Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

Paper • 2406.12624 • Published Jun 18, 2024 • 38

upvoted 2 papers over 1 year ago

Describing Differences in Image Sets with Natural Language

Paper • 2312.02974 • Published Dec 5, 2023 • 16

De-Diffusion Makes Text a Strong Cross-Modal Interface

Paper • 2311.00618 • Published Nov 1, 2023 • 23