Generate detailed music descriptions from audio clips
Identify sound sources in images using audio
西北工业大学ASLP实验室OSUM项目demo展示
Blazingly Fast and Embarrassingly Simple Song Generation