Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can I use VAD for OpenVoice2? #325

Open
qzyyyy opened this issue Oct 24, 2024 · 2 comments
Open

Can I use VAD for OpenVoice2? #325

qzyyyy opened this issue Oct 24, 2024 · 2 comments
Labels
question Further information is requested

Comments

@qzyyyy
Copy link

qzyyyy commented Oct 24, 2024

I want to clone a voice from audio segments that are shorter than 1.5 seconds after splitting, and I'm wondering if I can use VAD instead of Whisper in OpenVoice2.

Copy link

dosubot bot commented Oct 24, 2024

@qzyyyy Hello there! I'm here to assist you with any bugs, questions, or contributions while you wait for a human maintainer. Let's tackle this together!

Yes, you can use VAD (Voice Activity Detection) in OpenVoice2. The get_se function allows you to choose between VAD and Whisper for processing audio. By setting the vad parameter to True, the function will use VAD to segment the audio. However, keep in mind that OpenVoice2 filters out audio segments shorter than 1.5 seconds, so these segments will not be saved or used for voice cloning [1].

To continue talking to Dosu, mention @dosu.

@dosubot dosubot bot added the question Further information is requested label Oct 24, 2024
@qzyyyy
Copy link
Author

qzyyyy commented Oct 24, 2024

Will VAD effect the performance of OpenVoice2?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant