-
Notifications
You must be signed in to change notification settings - Fork 308
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Auto accents for 2015-2019 #376
Conversation
…checking changes. Some changes were made to auto_authors to reduce the need for manual checking in the future. NAACL 2019 was not included.
NAACL 2019 will be done shortly, so please don't approve just yet. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I looked this over and checked a few random entries, but couldn’t turn up any errors. This is really cool.
Do you want to update the MOTD with this PR? It’d be nice to have some mention of this in the landing page banner. |
I don’t think it merits an announcement, especially since not all years are done yet and I am still figuring out how to do spelling changes. |
Auto accents for 2015-2019
This is the run of auto_authors.py on conferences 2015-2019 but excluding NAACL 2019. Most of the changes involve restoration of accents that appear in the PDF but not the metadata, but there are also capitalization, hyphenation, and spacing changes.
There are no actual spelling changes; for example, if the PDF says Kathleen but the metadata says Kathy, the script prints a warning but does not change anything.