You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The error may be caused by false alignment. Please check the textgrid file that "sp", "spn", "sil" are not empty or "". Duration and spec length should be matched so that model can converge.
I found some textGrid files does not have sil(sil, sp, spn) , but other files have . I used mfa tool and use the token "english_us_arpa english_us_arpa" as model. why the generated TextGrid files different?
I have added a check for empty silent phones. Update to the latest code, and reprocessed the dataset to see if there are any remaining issues. Hope this can help you.
dataset.py check:
assert abs(codes.size(-1) - sum(duration)) < 3, (codes.size(-1), sum(duration), filename)
assert abs(audio.shape[1]-lmin * self.hop_length) < 3 * self.hop_length
why to check the encode and duration?
The text was updated successfully, but these errors were encountered: