How many hours of speech data was this model trained on?

by stefanr123 - opened 1 day ago

1 day ago

Just out of interest what do the 60,000 utterances amount to and was that the sole training data, or was the base model already trained on other speech? Just trying to understand what amount of data it would take to train a model of this quality in another language.

bharathkumar1922001

Maya Research org about 7 hours ago

The 60k utterances are merely an adaptor fix. We have much more data that base was trained on, and we had to remove some speaker embeddings and focus on these speakers. So hence the 60k voice samples are re-trained for the Open Source adaptor. The base has more knowledge.

stefanr123

about 4 hours ago

Thank you! Can you share a rough ballpark for data used for the base model?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment