Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/
Yifan Peng
pyf98
AI & ML interests
Speech Processing, Speech Recognition, Spoken Language Processing
Recent Activity
upvoted
a
collection
about 20 hours ago
OLMo 2
authored
a paper
25 days ago
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
authored
a paper
25 days ago
On the Effects of Heterogeneous Data Sources on Speech-to-Text
Foundation Models