-
Notifications
You must be signed in to change notification settings - Fork 42
Description
Hi,
Apologies if this is a silly question. If I train a model with a dataset made from a different Sample rate; will this technique still work? eg the training data would come from normal speech/singing @40kHz, and time synced pairs of response from a 40khz RVC model.
Without changing anything internal to the LLVC model, can I use a different Sample Rate? (granted that I've made a dataset at 40khz for instance)
(Would changing the SR in config actually do anything to the model?)
I think the paper said 3 days on a decent GPU, I'm guessing training time would be more for higher sample rate.
Also I'm intrigued about the paper's mention of fine tuning to speaker identities. Whether it's always 3 days training, or once you have a base pretrained model, the fine tuning to custom voice is less time.
Thank you