Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zhangtao's picture
2 11 2

zhangtao

zhangtao-whu
huathedev's profile picture HarborYuan's profile picture dark-pen's profile picture
·
https://github.com/zhang-tao-whu
  • zhang-tao-whu

AI & ML interests

segmentation

Recent Activity

updated a dataset 2 days ago
zhangtao-whu/sam_tfrecords
updated a dataset 2 days ago
zhangtao-whu/sam_tfrecords
updated a dataset 2 days ago
zhangtao-whu/sam_tfrecords
View all activity

Organizations

Wuhan Univeristy's profile picture Dense World's profile picture Path to Multimodal Generalist's profile picture

authored 6 papers 6 months ago

Point Cloud Mamba: Point Cloud Learning via State Space Model

Paper • 2403.00762 • Published Mar 1, 2024

DVIS++: Improved Decoupled Framework for Universal Video Segmentation

Paper • 2312.13305 • Published Dec 20, 2023

Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs

Paper • 2501.04670 • Published Jan 8

OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding

Paper • 2406.19389 • Published Jun 27, 2024 • 55

DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries

Paper • 2404.00086 • Published Mar 29, 2024

DVIS: Decoupled Video Instance Segmentation Framework

Paper • 2306.03413 • Published Jun 6, 2023
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs