This collection includes the implementation of models described in Vox-Profile benchmark. (https://arxiv.org/pdf/2505.14648)
-
tiantiaf/whisper-large-v3-msp-podcast-emotion
Audio Classification • 2B • Updated • 306 • 2 -
tiantiaf/whisper-large-v3-narrow-accent
Audio Classification • 2B • Updated • 160 • 2 -
tiantiaf/wavlm-large-broader-accent
Audio Classification • Updated • 46 • 2 -
tiantiaf/whisper-large-v3-speech-flow
Audio Classification • 2B • Updated • 146 • 1