Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dataset.py : codes.size(-1)= 48960, sum(duration)=38678 assert error: #15

Open
lileishitou opened this issue Aug 2, 2023 · 4 comments

Comments

@lileishitou
Copy link

dataset.py check:

assert abs(codes.size(-1) - sum(duration)) < 3, (codes.size(-1), sum(duration), filename)
assert abs(audio.shape[1]-lmin * self.hop_length) < 3 * self.hop_length

why to check the encode and duration?

@adelacvg
Copy link
Owner

adelacvg commented Aug 2, 2023

The error may be caused by false alignment. Please check the textgrid file that "sp", "spn", "sil" are not empty or "". Duration and spec length should be matched so that model can converge.

@lileishitou
Copy link
Author

I found some textGrid files does not have sil(sil, sp, spn) , but other files have . I used mfa tool and use the token "english_us_arpa english_us_arpa" as model. why the generated TextGrid files different?

@adelacvg
Copy link
Owner

adelacvg commented Aug 3, 2023

I have added a check for empty silent phones. Update to the latest code, and reprocessed the dataset to see if there are any remaining issues. Hope this can help you.

@lileishitou
Copy link
Author

tks, that helps a lot.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants