Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

STS can not assign a specified voice #119

Open
blues4347 opened this issue Jun 25, 2024 · 1 comment
Open

STS can not assign a specified voice #119

blues4347 opened this issue Jun 25, 2024 · 1 comment

Comments

@blues4347
Copy link

blues4347 commented Jun 25, 2024

in sts tab @ any voice i uploaded selected , output is alway same one (cn-nan) while voice from the code(such as cn-nan.wav, cn-XiaoyiNeural) selected, output is the voice selected.

@BobHop
Copy link

BobHop commented Jul 31, 2024

Hi, same problem here. TTS works very well, but STS does not. Although the console window says it's using the selected input wav file and cloned voice wav file, the resulting audio doesn't make use of the cloned voice. [EDIT 20240801] In fact, it does make use of the cloned voice! But somehow the speech-to-speech process eliminates anything unusual about the cloned voice, making it sound very neutral. You can try it yourself: use speech-to-speech with a recording of yourself imitating an old man, a little girl, Mickey Mouse or something else -> the STS generation will work and sound different each time (a bit lower, a bit higher) but won't retain the specificities of your imitation. So I'm not sure where the problem comes from, especially when you consider that TTS generation does work and retain the specificities of the cloned voice.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants