Spaces:

TTS-AGI
/

TTS-Arena

Running on CPU Upgrade

PlayDialog 1.0

#83

by legofan94 - opened 11 days ago

11 days ago

We've released PlayDialog 1.0, a huge rework since Dialog Beta (which is currently on the leaderboard). Could we add in Dialog 1.0?

https://x.com/play_ht/status/1887967775207121021

mrfakename

TTS AGI org 11 days ago

Hey,
Thanks for reaching out! We can definitely upgrade the model to v1.0. Has the PlayDialog-http model automatically been upgraded from Beta to v1.0 in the Python API?

legofan94

9 days ago

Is it possible to relaunch it as a new model? Are you able to share the params you are currently passing through and the endpoint you are hitting?

bryananderson

9 days ago

•

edited 9 days ago

If you're using our Python SDK (pyht), I recommend upgrading to the latest version and then calling tts() with voice_engine='PlayDialog' and protocol='http' instead of voice_engine='PlayDialog-http' as we changed the API to separate them. Otherwise it will be the same.

mrfakename

TTS AGI org 9 days ago

Is it possible to relaunch it as a new model? Are you able to share the params you are currently passing through and the endpoint you are hitting?

Yes, we can relaunch as a new model. Should it be labelled as PlayDialog 1.0?

The current params being used:

for chunk in play_client.tts(text, TTSOptions(voice="s3://voice-cloning-zero-shot/831bd330-85c6-4333-b2b4-10c476ea3491/original/manifest.json"), voice_engine="PlayDialog-http"):

legofan94

8 days ago

That is the correct labeling! Will wait for @bryananderson to confirm -- but a question for you @mrfakename is how do you pick voices / do you cycle through them? Do you only pick one? Should we provide you a list?

legofan94

2 days ago

Bumping @mrfakename

Can you share the params for the python sdk payload?

mrfakename

TTS AGI org 1 day ago

•

edited 1 day ago

Hey @legofan94 ,
So sorry about the delay. Here's the code used for generation:

voice_engine = "PlayDialog"
tts_options = TTSOptions(voice="s3://voice-cloning-zero-shot/831bd330-85c6-4333-b2b4-10c476ea3491/original/manifest.json")
for chunk in play_client.tts(text, tts_options, voice_engine=voice_engine):
    if chunk == b'':
        play_client.close()
        break
    f.write(chunk)
    return f.name, None

Working on getting it added now! Are these the right params to use?

legofan94

1 day ago

Almost! Can you use this voice instead of the one you have in params? It's a good neutral voice if you're only taking one. If you want to have different accents and speaker styles we can provide more.

@mrfakename

s3://voice-cloning-zero-shot/42c41808-0ddb-4674-8965-024a52ad6c8e/original/manifest.json

mrfakename

TTS AGI org 1 day ago

Makes sense, switched to that voice. Are there any other settings that need to be adjusted before launch?

legofan94

1 day ago

Nope -- you're otherwise good!

mrfakename

TTS AGI org 1 day ago

•

edited 1 day ago

Should be live shortly!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment