You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you very much for building this tool. I want to use it to segment/align libri-light for training TTS. I am new to this tool. Can anyone help me with the following questions:
If I want to segment the books to about 10 sec chunks (rather than 30), what hyperparameters I should change?
In the output, there are two sets of texts, lowercase with punctuations, and uppercase without punctuations, which one should I use as the ground truth for training TTS?
Thank you so much for any help.
The text was updated successfully, but these errors were encountered:
In the output, there are two sets of texts, lowercase with punctuations, and uppercase without punctuations, which one should I use as the ground truth for training TTS?
It's up to you, I will suggest to use texts with punctuations.
@pkufool Thank you so much. From a quick look at the documentation, it looks to me that the texts with punctuations are the reference, and the uppercase ones are the output from ASR. I am wondering is the uppercase text equally accurate as the reference, if I want to use them to train TTS?
No. If you don't want the punctuations, you can remove them and convert the punctuation texts to uppercase, it is not a good idea to use the ASR transcrptions to train TTS.
Hi,
Thank you very much for building this tool. I want to use it to segment/align libri-light for training TTS. I am new to this tool. Can anyone help me with the following questions:
Thank you so much for any help.
The text was updated successfully, but these errors were encountered: