Can I use VAD for OpenVoice2? #325

qzyyyy · 2024-10-24T12:36:12Z

I want to clone a voice from audio segments that are shorter than 1.5 seconds after splitting, and I'm wondering if I can use VAD instead of Whisper in OpenVoice2.

dosubot · 2024-10-24T12:37:46Z

@qzyyyy Hello there! I'm here to assist you with any bugs, questions, or contributions while you wait for a human maintainer. Let's tackle this together!

Yes, you can use VAD (Voice Activity Detection) in OpenVoice2. The get_se function allows you to choose between VAD and Whisper for processing audio. By setting the vad parameter to True, the function will use VAD to segment the audio. However, keep in mind that OpenVoice2 filters out audio segments shorter than 1.5 seconds, so these segments will not be saved or used for voice cloning ^[1].

To continue talking to Dosu, mention @dosu.

qzyyyy · 2024-10-24T12:40:43Z

Will VAD effect the performance of OpenVoice2?

dosubot bot added the question Further information is requested label Oct 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can I use VAD for OpenVoice2? #325

Can I use VAD for OpenVoice2? #325

qzyyyy commented Oct 24, 2024

dosubot bot commented Oct 24, 2024

qzyyyy commented Oct 24, 2024

Can I use VAD for OpenVoice2? #325

Can I use VAD for OpenVoice2? #325

Comments

qzyyyy commented Oct 24, 2024

dosubot bot commented Oct 24, 2024

qzyyyy commented Oct 24, 2024