-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
udapi-markbugs: fix multi-obj #34
Comments
In p05_017 (dev) both Mr Macaulay and £4/11 are tagged
This is annoying because you can remove either of them and the sentence still makes sense. Looking through the examples in DASG the amount paid is occasionally marked with le, what is being paid for is marked with air or airson and sometimes the payee is marked with do. I wonder whether the best thing to do is therefore to say that the payee is an unmarked |
…l having been mistagged as mark:prt + ccomp rather than mark + advcl. Also fixed erroneous use of the word 'token' in not-to-release/validate_gd_extras.py.
…ng 'if') from mark:prt to mark, addressing UniversalDependencies/UD_Scottish_Gaelic-ARCOSG#34
There are what look at first glance like some genuine multiple object cases where a verbal noun is preceded by the object and also the aspect marker ag combined with a personal possessive pronoun to form gam, gad, ga, gan, e.g. ns08_024 (dev)
This is a bit like the problem discussed here: UniversalDependencies/UD_Welsh-CCG#3 but not exactly. |
Examples here: https://leacan.gla.ac.uk/leacan/?gd=ag show that this can be thought of a passive-like construction:
In the case above I'm treating ionnsaigh ga thoirt air boireannach a raoir as being an
|
There are a few cases where the infinitive particle (a + lenition) has been tagged in the ARCOSG corpus as the possessive (also a + lenition). This has led to the multiple-object error because the possessive pronoun before a verbal noun is automatically tagged as the object, as it is in some constructions. They have been retagged. In one sentence:
the original text has an (the possessive plural pronoun, which does not lenite) in place of a + lenition. The |
…ng 'if') from mark:prt to mark, addressing #34
Not sure why this hasn't been caught by the other validation scripts...
The text was updated successfully, but these errors were encountered: