Speed. Seed?

#30

by CireRetsal - opened Apr 25

Apr 25

Would be nice if the audio was time stretched when the speed is decreased to keep the pitch. Also I didn't see anywhere I can set the seed for speaker consitency. Very random. I like it though. Thanks. Looking forward to updates.

JoshJarabek

Apr 28

Did you figure out the seed?

JoshJarabek

Apr 28

•

edited Apr 28

I figured out the speed of speech, you make the cfg_scale in generate lower. It's currently defaulted to 3.0, so it talks really fast. Can't figure out seed though.

UPDATE: Nevermind I don't think that's what it does

devnen

Apr 29

•

edited Apr 29

For speaker consistency, I have been able to set the custom fixed seed in the model.py file (inside the dia subfolder) under the generate function. However, this only ensures the same speaker if the input text is identical. For fully reproducible results across different texts, voice cloning is the recommended approach. You can check out my full implementation here:
https://github.com/devnen/Dia-TTS-Server

For adjusting speech speed, cfg_scale doesn’t directly control it. Instead, use the dedicated speed parameter in the API/UI. This applies postprocessing to resample the generated audio while maintaining quality.

CireRetsal

Apr 29

For speaker consistency, I have been able to set the custom fixed seed in the model.py file (inside the dia subfolder) under the generate function. However, this only ensures the same speaker if the input text is identical. For fully reproducible results across different texts, voice cloning is the recommended approach. You can check out my full implementation here:
https://github.com/devnen/Dia-TTS-Server

For adjusting speech speed, cfg_scale doesn’t directly control it. Instead, use the dedicated speed parameter in the API/UI. This applies postprocessing to resample the generated audio while maintaining quality.

CireRetsal

Apr 29

Oh dang!!!! I just peeked at it. I'm gonna try this tonight. Thank you.

CireRetsal changed discussion status to closed Apr 29

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment