Bitextor testing output
Testing output files which differ from v4, has been generated using a commit very close to this commit.
Changes, apparently, has been caused due to:
- Removing default tokenizer from Bicleaner (now it is provided only if the user provides a tokenizer)
- Due to the different scores of Bicleaner, the number of sentences in some tests have been altered due to a configured threshold.
- Bicleaner AI submodule was updated, and scores might have been altered for this reason as well.
- Some output files have different order since the condition for sorting has been lightly changed (e.g.
run-deferred-tests.tgz
).
Update (after the release was published):
- Tests 40 and 50 have been enabled again: bitextor/bicleaner#72
- Test 40.1 was failing, what led to think that, specifically, hunalign was returning non-deterministic values depending on the machine that the tests were executed. Actually, we didn't notice that a different dictionary was being used, which was the reason why there were different values. The real reason why different results were being obtained was that in GHA, the tests are executed concurrently and in separate machines, while locally all the tests were being executed concurrently but in the same machine. This situation caused that, locally, the dictionary was being replaced. Fix: bitextor/bitextor@2a69167
- Older tests had been uploaded for
run-tests.tgz
file. It's been fixed.