Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/
Yifan Peng
pyf98
AI & ML interests
Multimodal LLMs, Speech-to-Speech, Speech Recognition
Recent Activity
new activity
12 days ago
espnet/yodas_owsmv4:When data will be published?
updated
a dataset
2 months ago
espnet/yodas_owsmv4
updated
a dataset
2 months ago
espnet/yodas_owsmv4