Different Sample Rate (With retraining)

Hi,

Apologies if this is a silly question. If I train a model with a dataset made from a different Sample rate; will this technique still work? eg the training data would come from normal speech/singing @40khz, and time synced pairs of response from a 40khz RVC model.

Without changing anything internal to the LLVC model, can I use a different Sample Rate? (granted that I've made a dataset at 40khz for instance)

(Would changing the SR in config actually do anything to the model?)

I think the paper said 3 days on a decent GPU, I'm guessing training time would be more for higher sample rate. 

Also I'm intrigued about the paper's mention of fine tuning to speaker identities. Whether it's always 3 days training, or once you have a base pretrained model, the fine tuning to custom voice is less time. 

Thank you


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Different Sample Rate (With retraining) #6

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Different Sample Rate (With retraining) #6

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions