2 17 5

David Junhao ZHANG

Junhao233

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 months ago

Scaling RL to Long Videos

upvoted a paper 6 months ago

MMSearch-R1: Incentivizing LMMs to Search

upvoted a paper 7 months ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

View all activity

Organizations

upvoted 2 papers 6 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 159

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25, 2025 • 64

upvoted a paper 7 months ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19, 2025 • 130

upvoted 4 papers 10 months ago

upvoted 3 papers about 1 year ago

Causal Diffusion Transformers for Generative Modeling

Paper • 2412.12095 • Published Dec 16, 2024 • 23

ROICtrl: Boosting Instance Control for Visual Generation

Paper • 2411.17949 • Published Nov 27, 2024 • 87

ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning

Paper • 2411.05003 • Published Nov 7, 2024 • 71

liked a Space about 1 year ago

Diffusers Image Outpaint

🔅

2.46k

Easily expand image boundaries

upvoted a paper about 1 year ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 75

authored a paper about 1 year ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 75

upvoted a paper over 1 year ago

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22, 2024 • 51

liked a model almost 2 years ago

showlab/show-1-sr2

Text-to-Video • Updated Oct 12, 2023 • 2.56k • 10

authored 2 papers almost 2 years ago

DragAnything: Motion Control for Anything using Entity Representation

Paper • 2403.07420 • Published Mar 12, 2024 • 14

Towards A Better Metric for Text-to-Video Generation

Paper • 2401.07781 • Published Jan 15, 2024 • 15

upvoted a paper almost 2 years ago

Towards A Better Metric for Text-to-Video Generation

Paper • 2401.07781 • Published Jan 15, 2024 • 15

upvoted a paper about 2 years ago

Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions

Paper • 2401.01827 • Published Jan 3, 2024 • 18

liked a Space about 2 years ago

MotionDirector

🏃