DaoanZhang's picture

DaoanZhang

DwanZhang

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 25 days ago

upvoted a paper 5 months ago

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

new activity 5 months ago

guyuchao/Mira:It seems that it need password to decrypt this dataset?

View all activity

Organizations

upvoted a collection 25 days ago

DeepSeek-V4

4 items • Updated 25 days ago • 646

upvoted a paper 5 months ago

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

Paper • 2512.22905 • Published Dec 28, 2025 • 20

New activity in guyuchao/Mira 5 months ago

It seems that it need password to decrypt this dataset?

#1 opened 8 months ago by

upvoted a paper 5 months ago

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

Paper • 2512.16915 • Published Dec 18, 2025 • 38

liked a dataset 5 months ago

Zheyuan14/VideoAds

Viewer • Updated Nov 10, 2025 • 1.2k • 58 • 7

upvoted 2 papers 5 months ago

OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification

Paper • 2512.10756 • Published Dec 11, 2025 • 35

Unified Video Editing with Temporal Reasoner

Paper • 2512.07469 • Published Dec 8, 2025 • 46

updated a model 5 months ago

onlinequery/users_query

Updated Dec 9, 2025

published a model 5 months ago

onlinequery/users_query

Updated Dec 9, 2025

liked a dataset 6 months ago

LeeLi4704/VEU-Bench

Preview • Updated Jun 14, 2025 • 204 • 8

upvoted 3 papers 6 months ago

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

Paper • 2511.18050 • Published Nov 22, 2025 • 38

VIDEOP2R: Video Understanding from Perception to Reasoning

Paper • 2511.11113 • Published Nov 14, 2025 • 112

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

Paper • 2511.08521 • Published Nov 11, 2025 • 39

published a dataset 10 months ago

worldrl/Uni-Janus

Viewer • Updated Aug 9, 2025 • 19.2k • 233

upvoted a paper 10 months ago

MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27, 2025 • 15

updated a dataset 10 months ago

Proactive-lmm-2/video_2

Updated Jul 26, 2025 • 5

published a dataset 10 months ago

Proactive-lmm-2/video_2

Updated Jul 26, 2025 • 5

updated a dataset 10 months ago

DwanZhang/useless_store

Updated Jul 9, 2025 • 5

New activity in rghermi/sf20k 11 months ago

Request for Alternative Access to SF20K Videos for Academic Research

#2 opened 12 months ago by

upvoted a paper 12 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 309