Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
沈云航 Yunhang Shen's picture
2 9 24

沈云航 Yunhang Shen PRO

shenyunhang
gatilin's profile picture YannisTevissen's profile picture dreaming12580's profile picture
·
https://shenyunhang.github.io/
  • shenyunhang

AI & ML interests

None yet

Recent Activity

upvoted a paper about 22 hours ago
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise
upvoted a paper about 22 hours ago
Woodpecker: Hallucination Correction for Multimodal Large Language Models
upvoted a paper about 22 hours ago
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
View all activity

Organizations

VITA-MLLM's profile picture

Papers 6

arxiv:2503.14504
arxiv:2501.01957
arxiv:2408.05211
arxiv:2405.21075

spaces 3

pinned
Running on Zero
2

VITA-Audio

🚀

Generate text, speech, or interpret audio

4 days ago
Build error
20

APE

🌍

Feb 27
Running on Zero
2

Long VITA

🏃

Long-VITA Demo

Feb 26

models 1

shenyunhang/APE

Updated Mar 28, 2024 • 7

datasets 5

shenyunhang/AudioQA-1M

Viewer • Updated 1 day ago • 5.1k • 60

shenyunhang/VoiceAssistant-400K

Updated Mar 23 • 45.7k

shenyunhang/AISHELL-4

Preview • Updated Mar 23 • 5.91k

shenyunhang/AISHELL-3

Preview • Updated Mar 23 • 274

shenyunhang/AISHELL-1

Updated Mar 23 • 3.23k
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs