dataset.py ： codes.size(-1)= 48960， sum(duration）=38678 assert error: #15

lileishitou · 2023-08-02T08:23:19Z

dataset.py check：

assert abs(codes.size(-1) - sum(duration)) < 3, (codes.size(-1), sum(duration), filename)
assert abs(audio.shape[1]-lmin * self.hop_length) < 3 * self.hop_length

why to check the encode and duration?

adelacvg · 2023-08-02T11:46:39Z

The error may be caused by false alignment. Please check the textgrid file that "sp", "spn", "sil" are not empty or "". Duration and spec length should be matched so that model can converge.

lileishitou · 2023-08-03T04:06:23Z

I found some textGrid files does not have sil(sil， sp, spn) ， but other files have . I used mfa tool and use the token "english_us_arpa english_us_arpa" as model. why the generated TextGrid files different?

adelacvg · 2023-08-03T12:38:00Z

I have added a check for empty silent phones. Update to the latest code, and reprocessed the dataset to see if there are any remaining issues. Hope this can help you.

lileishitou · 2023-08-04T06:02:25Z

tks, that helps a lot.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dataset.py ： codes.size(-1)= 48960， sum(duration）=38678 assert error: #15

dataset.py ： codes.size(-1)= 48960， sum(duration）=38678 assert error: #15

lileishitou commented Aug 2, 2023

adelacvg commented Aug 2, 2023

lileishitou commented Aug 3, 2023

adelacvg commented Aug 3, 2023

lileishitou commented Aug 4, 2023

dataset.py ： codes.size(-1)= 48960， sum(duration）=38678 assert error: #15

dataset.py ： codes.size(-1)= 48960， sum(duration）=38678 assert error: #15

Comments

lileishitou commented Aug 2, 2023

adelacvg commented Aug 2, 2023

lileishitou commented Aug 3, 2023

adelacvg commented Aug 3, 2023

lileishitou commented Aug 4, 2023