STS can not assign a specified voice #119

blues4347 · 2024-06-25T07:18:41Z

in sts tab @ any voice i uploaded selected , output is alway same one (cn-nan) while voice from the code（such as cn-nan.wav, cn-XiaoyiNeural） selected, output is the voice selected.

BobHop · 2024-07-31T20:07:52Z

Hi, same problem here. TTS works very well, but STS does not. Although the console window says it's using the selected input wav file and cloned voice wav file, ~~the resulting audio doesn't make use of the cloned voice~~. [EDIT 20240801] In fact, it does make use of the cloned voice! But somehow the speech-to-speech process eliminates anything unusual about the cloned voice, making it sound very neutral. You can try it yourself: use speech-to-speech with a recording of yourself imitating an old man, a little girl, Mickey Mouse or something else -> the STS generation will work and sound different each time (a bit lower, a bit higher) but won't retain the specificities of your imitation. So I'm not sure where the problem comes from, especially when you consider that TTS generation does work and retain the specificities of the cloned voice.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

STS can not assign a specified voice #119

STS can not assign a specified voice #119

blues4347 commented Jun 25, 2024 •

edited

Loading

BobHop commented Jul 31, 2024 •

edited

Loading

STS can not assign a specified voice #119

STS can not assign a specified voice #119

Comments

blues4347 commented Jun 25, 2024 • edited Loading

BobHop commented Jul 31, 2024 • edited Loading

blues4347 commented Jun 25, 2024 •

edited

Loading

BobHop commented Jul 31, 2024 •

edited

Loading