Training new model #654

robsco · 2024-11-24T18:33:50Z

robsco
Nov 24, 2024

I'm experimenting training a brand new model, not using an existing checkpoint.

I have about 130 WAV files.

Are there any recommendations for the --validation-split and --num-test-examples ?

These are both zero in the docs, but that may be assuming you're using an existing checkpoint to begin with.

Thanks,
Rob

EvanMason489 · 2024-12-23T09:27:45Z

EvanMason489
Dec 23, 2024

I'm no expert, but I tried this too. After getting to about 4000 epochs, training with a hundred 20 second clips it still sounds really buzzy and is unintelligible.

Weirdly, it does sound like the voiced person (pitch, accent, 'modulation'), just not speaking intelligible words.

Same data for fine-tuning the lessac model, perfect. Repeated it for my own data using piper-recording, 100 samples, also fine.

I am guessing that for getting the model to a good working state, you need a ____load of data.

0 replies

robsco · 2024-12-23T10:59:27Z

robsco
Dec 23, 2024
Author

I went to 10,000 and still not quite right, so I think your thinking about using a base models which probably had thousands of clips is spot on.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training new model #654

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Training new model #654

robsco Nov 24, 2024

Replies: 2 comments

EvanMason489 Dec 23, 2024

robsco Dec 23, 2024 Author

robsco
Nov 24, 2024

EvanMason489
Dec 23, 2024

robsco
Dec 23, 2024
Author