Haobo Yuan's picture

15 8 4

Haobo Yuan

HarborYuan

·

https://yuanhaobo.me

AI & ML interests

computer vision

Recent Activity

new activity about 1 month ago

General-Level/General-Bench-Openset:Delete AnimalComplexSceneReasoningVideoObjectSegmentation.zip

authored a paper 3 months ago

On Path to Multimodal Generalist: General-Level and General-Bench

upvoted a paper 3 months ago

On Path to Multimodal Generalist: General-Level and General-Bench

View all activity

Organizations

New activity in General-Level/General-Bench-Openset about 1 month ago

Delete AnimalComplexSceneReasoningVideoObjectSegmentation.zip

#12 opened about 1 month ago by

authored a paper 3 months ago

On Path to Multimodal Generalist: General-Level and General-Bench

Paper • 2505.04620 • Published May 7 • 83

upvoted a paper 3 months ago

On Path to Multimodal Generalist: General-Level and General-Bench

Paper • 2505.04620 • Published May 7 • 83

New activity in General-Level/General-Bench-Openset 3 months ago

Delete video/comrehension

#8 opened 3 months ago by

Delete video/comrehension

#9 opened 3 months ago by

New activity in General-Level/General-Bench-Openset 4 months ago

Delete video/comrehension

#5 opened 4 months ago by

upvoted a paper 4 months ago

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published Apr 8 • 63

authored a paper 4 months ago

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published Apr 8 • 63

New activity in ByteDance/Sa2VA-1B 7 months ago

ValueError due to Mismatch in Tensor Shapes when Loading Model

#3 opened 7 months ago by

updated a dataset 7 months ago

Dense-World/Sa2VA-Training

Updated Jan 20 • 382 • 4

liked a dataset 7 months ago

Dense-World/Sa2VA-Training

Updated Jan 20 • 382 • 4

updated 2 models 7 months ago

Dense-World/Sa2VA-26B

26B • Updated Jan 17 • 1

Dense-World/Sa2VA-1B

1B • Updated Jan 17 • 2

published 2 models 7 months ago

Dense-World/Sa2VA-1B

1B • Updated Jan 17 • 2

Dense-World/Sa2VA-26B

26B • Updated Jan 17 • 1

updated a dataset 7 months ago

HarborYuan/omgseg_data

Updated Jan 17 • 75 • 1

New activity in ByteDance/Sa2VA-4B 7 months ago

Issue when running inference with the 4B model

#3 opened 7 months ago by

authored 2 papers 7 months ago

LLAVADI: What Matters For Multimodal Large Language Models Distillation

Paper • 2407.19409 • Published Jul 28, 2024

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published Jan 7 • 47

upvoted a collection 7 months ago

Sa2VA Model Zoo

Huggingace Model Zoo For Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos By Bytedance Seed CV Research • 4 items • Updated Feb 9 • 37