Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Indersci urlid.20241127 #152

Merged
merged 5 commits into from
Dec 10, 2024

Conversation

seasidesparrow
Copy link
Member

This pr fixes a few edge cases:

  • Records from InderSci do not field the DOI, but the DOI is always a part of the publisher URL, so take the DOI from the URL
  • Records from Geological Soc. of London may have affils in a separate contrib block, but without including the xids in the author definitions. Assuming the Author list and xref dictionary have the same length (and that xref is an ordered dict), and checking that the author aff and xref fields are empty, put the add the xids to the author xref key-value pair.
  • Minor formatting tweaks

 	modified:   adsingestp/parsers/jats.py
 	new file:   tests/stubdata/input/jats_gsl_unkeyed_xref.xml
 	new file:   tests/stubdata/input/jats_indersci_url_ident.xml
 	new file:   tests/stubdata/output/jats_gsl_unkeyed_xref.json
 	new file:   tests/stubdata/output/jats_indersci_url_ident.json
 	modified:   tests/test_jats.py
 	modified:   adsingestp/parsers/jats.py
 	modified:   adsingestp/parsers/jats.py
 	modified:   adsingestp/parsers/jats.py
 	modified:   tests/stubdata/input/jats_gsl_unkeyed_xref.xml
 	modified:   tests/stubdata/output/jats_gsl_unkeyed_xref.json
 	modified:   adsingestp/parsers/jats.py
 	modified:   tests/stubdata/output/jats_gsl_unkeyed_xref.json
@codecov-commenter
Copy link

Codecov Report

Attention: Patch coverage is 88.00000% with 3 lines in your changes missing coverage. Please review.

Project coverage is 91.38%. Comparing base (8bb2152) to head (3e17296).

Files with missing lines Patch % Lines
adsingestp/parsers/jats.py 88.00% 3 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main     #152      +/-   ##
==========================================
- Coverage   91.39%   91.38%   -0.02%     
==========================================
  Files          25       25              
  Lines        2940     2959      +19     
==========================================
+ Hits         2687     2704      +17     
- Misses        253      255       +2     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Copy link
Contributor

@mugdhapolimera mugdhapolimera left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good to me. Feel free to merge at your convenience.

@seasidesparrow seasidesparrow merged commit 8f3573e into adsabs:main Dec 10, 2024
4 checks passed
@seasidesparrow seasidesparrow deleted the indersci_urlid.20241127 branch December 10, 2024 13:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants