19 28 24

Zhang Yuanhan

ZhangYuanhan

https://zhangyuanhan-ai.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

upvoted a paper 6 days ago

POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion

liked a dataset about 1 month ago

lmms-lab/video-tt

View all activity

Organizations

upvoted a paper 5 days ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published 9 days ago • 76

upvoted a paper 6 days ago

POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion

Paper • 2509.01215 • Published 8 days ago • 45

liked a dataset about 1 month ago

lmms-lab/video-tt

Viewer • Updated Jul 26 • 10k • 605 • 4

updated a dataset about 2 months ago

lmms-lab/video-tt

Viewer • Updated Jul 26 • 10k • 605 • 4

upvoted 2 papers about 2 months ago

SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search

Paper • 2507.15245 • Published Jul 21 • 11

Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding

Paper • 2507.15028 • Published Jul 20 • 20

authored a paper about 2 months ago

Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding

Paper • 2507.15028 • Published Jul 20 • 20

commented a paper about 2 months ago

Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding

Paper • 2507.15028 • Published Jul 20 • 20 •

upvoted a paper 3 months ago

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25 • 63

upvoted a paper 5 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 287

upvoted a paper 6 months ago

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness

Paper • 2503.21755 • Published Mar 27 • 34

updated a collection 6 months ago

LMM RL

Collection

3 items • Updated Mar 13

upvoted a paper 6 months ago

BIMBA: Selective-Scan Compression for Long-Range Video Question Answering

Paper • 2503.09590 • Published Mar 12 • 3

updated a collection 6 months ago

Vision Language General

Collection

Vision Language General • 7 items • Updated Mar 13

updated a dataset 6 months ago

lmms-lab/AISG_Challenge

Viewer • Updated Mar 11 • 1.5k • 11 • 5

liked a Space 6 months ago

EgoGPT

👁

Analyze video to describe actions and transcribe audio

liked a dataset 6 months ago

lmms-lab/AISG_Challenge

Viewer • Updated Mar 11 • 1.5k • 11 • 5

updated a dataset 6 months ago

lmms-lab/video-tt

Viewer • Updated Jul 26 • 10k • 605 • 4

Zhang Yuanhan

AI & ML interests

Recent Activity

Organizations

ZhangYuanhan's activity

EgoGPT