Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
47
Follow
Electronic Engineering @Tsinghua University
35
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
main
SALMONN
/
resource
/
response_demo
Ctrl+K
Ctrl+K
5 contributors
History:
3 commits
Changli
chore: release v1
0bf5005
almost 2 years ago
aac.png
Safe
13 kB
chore: release v1
almost 2 years ago
aed.png
Safe
18.6 kB
chore: release v1
almost 2 years ago
asr.png
Safe
13.8 kB
chore: release v1
almost 2 years ago
emo.png
Safe
11.4 kB
chore: release v1
almost 2 years ago
jsac.png
Safe
21 kB
chore: release v1
almost 2 years ago
lyrics.png
Safe
40.7 kB
chore: release v1
almost 2 years ago
mc.png
Safe
28.8 kB
chore: release v1
almost 2 years ago
memo.png
Safe
32.3 kB
chore: release v1
almost 2 years ago
pr.png
Safe
14.8 kB
chore: release v1
almost 2 years ago
sac.png
Safe
29.1 kB
chore: release v1
almost 2 years ago
sq.png
Safe
22.5 kB
chore: release v1
almost 2 years ago
sr.png
Safe
15.9 kB
chore: release v1
almost 2 years ago
story.png
Safe
71.1 kB
chore: release v1
almost 2 years ago
title.png
Safe
27.3 kB
chore: release v1
almost 2 years ago