Wan't to share my Training-Set
Hey,
I am from Leipzig and I created around 12.000 German samples with Elevenlabs. I would like to share these Dataset (wav+txt) with you. I also able to create some more with specific texts, like mathematical formulary and so on. Furthermore, I trained my model with this set on my 5090 RTX for around 5 Days and still had some problems in order of the words, but the words itself sounds german enough for me. I then merged it with your model and was satisfied :D
Hey, that would be super helpful 😊. I am right now in the process of preparing a bigger dataset.
What length are the audio files? And how could you share it?
Hello,
I have contacted you via LinkedIn. The samples vary in length. I have generated samples from one word like “Hello!” to 2 minutes in some cases. I wasn't sure at first how long the samples should be. I trained my model for almost 7 days on my 5090 - around 1mio passes. I don't know if the weights are of any use to you? I can send it to you if you like. I found my model with your 65% merged quite good. Best regards.