Spaces:

openbmb
/

VoxCPM-Demo

Running

I believe the model works best when processed sentence by sentence

by Aid3445 - opened 4 days ago

4 days ago

This truly is a revolutionary model, and I'm really impressed with it. I think the model could benefit from auto-chunking longer texts because otherwise the model breaks down and stops being able to accurately pronounce words.

EasonLiu

OpenBMB org 4 days ago

Thanks for the suggestion. We actually tried that approach before, but we found the results weren't great. The resulting audio didn't sound very consistent, so we decided to remove that feature for this version.

pylotlight

2 days ago

So what's thte best way to generate long/infinite sentences/paragraphs if it doesn't auto chunk?
Also what about streaming?

EasonLiu

OpenBMB org about 4 hours ago

So what's thte best way to generate long/infinite sentences/paragraphs if it doesn't auto chunk?
Also what about streaming?

There's currently no great way to control consistency across multiple generations. You might try increasing the CFG value to see if that helps. And we plan to improve this in a future update.
For streaming, the latest inference code has been merged into the repository. You can now clone the repo to use it.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment