You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I want to clone a voice from audio segments that are shorter than 1.5 seconds after splitting, and I'm wondering if I can use VAD instead of Whisper in OpenVoice2.
The text was updated successfully, but these errors were encountered:
@qzyyyy Hello there! I'm here to assist you with any bugs, questions, or contributions while you wait for a human maintainer. Let's tackle this together!
Yes, you can use VAD (Voice Activity Detection) in OpenVoice2. The get_se function allows you to choose between VAD and Whisper for processing audio. By setting the vad parameter to True, the function will use VAD to segment the audio. However, keep in mind that OpenVoice2 filters out audio segments shorter than 1.5 seconds, so these segments will not be saved or used for voice cloning [1].
I want to clone a voice from audio segments that are shorter than 1.5 seconds after splitting, and I'm wondering if I can use VAD instead of Whisper in OpenVoice2.
The text was updated successfully, but these errors were encountered: