Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

tsinghua-ee
/
SALMONN

Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
Model card Files Files and versions
xet
Community
2
SALMONN / resource /response_demo
Ctrl+K
Ctrl+K
  • 5 contributors
History: 3 commits
Changli's picture
Changli
chore: release v1
0bf5005 almost 2 years ago
  • aac.png
    13 kB
    chore: release v1 almost 2 years ago
  • aed.png
    18.6 kB
    chore: release v1 almost 2 years ago
  • asr.png
    13.8 kB
    chore: release v1 almost 2 years ago
  • emo.png
    11.4 kB
    chore: release v1 almost 2 years ago
  • jsac.png
    21 kB
    chore: release v1 almost 2 years ago
  • lyrics.png
    40.7 kB
    chore: release v1 almost 2 years ago
  • mc.png
    28.8 kB
    chore: release v1 almost 2 years ago
  • memo.png
    32.3 kB
    chore: release v1 almost 2 years ago
  • pr.png
    14.8 kB
    chore: release v1 almost 2 years ago
  • sac.png
    29.1 kB
    chore: release v1 almost 2 years ago
  • sq.png
    22.5 kB
    chore: release v1 almost 2 years ago
  • sr.png
    15.9 kB
    chore: release v1 almost 2 years ago
  • story.png
    71.1 kB
    chore: release v1 almost 2 years ago
  • title.png
    27.3 kB
    chore: release v1 almost 2 years ago