Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New Terms - projectTitle; projectID; fundingBodyName; fundingBodyID #527

Closed
aaltenburger2 opened this issue Oct 28, 2024 · 6 comments
Closed

Comments

@aaltenburger2
Copy link

aaltenburger2 commented Oct 28, 2024

New terms

Submitter: Andreas Altenburger (GBIF Norway)

Efficacy Justification: I work at a university museum that publishes its collections on GBIF as datasets. We constantly receive requests from contributors to the museum collection, asking to be able to track "their" contributions at the record level on GBIF. This relates to private funders such as Ocean Census (https://oceancensus.org/) or the Mohn Foundation (https://mohnfoundation.no/), governmental funding from Artsdatabanken or the Research Council of Norway, and institutional internal funding. We need to be able to attribute the specimens to their respective projects and funders.

Demand Justification: Record-level attribution has been requested several times previously. See discussions
tdwg/dwc-qa#37
tdwg/dwc-qa#83
tdwg/dwc-qa#100
gbif/pipelines#836
gbif/ipt#1780
for more details.

Stability Justification: New terms for record-level attribution are unlikely to negatively impact existing implementations because these terms would be additional, optional fields that enhance the granularity of data attribution without altering existing data structures. Current users and systems can continue to operate without adopting these new terms immediately, allowing for a gradual integration. Moreover, these terms are designed to be backward-compatible, ensuring that they do not disrupt existing workflows or data integrity. The addition of these terms would primarily provide a means for more detailed tracking and reporting, which is a growing requirement from funders and contributors. This enhancement would improve the transparency and traceability of data contributions without imposing changes on those who do not require this level of detail.

Implications for dwciri: The introduction of the proposed new terms - projectTitle, projectID, fundingBodyName, and fundingBodyID - does not necessitate changes to existing dwciri term versions. The new terms would be added as properties within the Darwin Core namespace but would not alter the definitions or functionalities of existing dwciri terms. They are designed to be complementary and to integrate seamlessly with the current structure, ensuring that they do not disrupt existing implementations These additions would simply expand the capability of the Darwin Core standard to convey more detailed project and funding information, which is increasingly important for data transparency and traceability in biodiversity research.

Proposed attributes of the new term:

Term names: projectTitle; projectID; fundingBodyName; fundingBodyID
Term labels: Project Title; Project ID, Funding Body Name; Funding Body ID
Organized in Class: Occurence
Definition of the terms (normative):

projectTitle: The title or name of the project under which the data was collected or the specimen was acquired.

projectID: A list (concatenated and separated) of unique identifiers for the project(s) that contributed to the original dwc:Occurrence. The projectID can link multiple occurrence records associated with the same project but may be shared in
different datasets. The nature of the association can be described in the metadata project description element.

fundingBodyName: The name of the organization or agency that provided funding for the project.

fundingBodyID: A unique identifier for the funding organization or agency that supported the project.

Usage comments:
projectTitle: Use this term to provide the official or commonly recognized title or name of the project. This should be the title under which the project is known and cited. Avoid abbreviations unless they are widely understood. The recommended best practice is to separate the values in a list with space vertical bar space ( | ).
projectID: This term should be used to provide a globally unique identifier (GUID) for the project, if available. This could be a DOI, URI, or any other persistent identifier that ensures the project can be uniquely distinguished from others. The recommended best practice is to separate the values in a list with space vertical bar space ( | ).
fundingBodyName: Specify the full official name of the funding body. This should include the complete name without abbreviations, unless the abbreviation is an official and commonly recognized form (e.g., NSF for the National Science Foundation). The recommended best practice is to separate the values in a list with space vertical bar space ( | ).
fundingBodyID: Provide a unique identifier for the funding body, such as an identifier used in governmental or international databases. If no official identifier exists, use a persistent and unique identifier within your organization or dataset. The recommended best practice is to separate the values in a list with space vertical bar space ( | ).

Examples:
projectTitle: The Nansen Legacy; Scalidophora i Noreg; Arctic Deep
projectID: RCN276730; Artsproject_7-24; OC202405
fundingBodyName: Norges forskningsråd; Artsdatabanken; Ocean Census | Nippon Foundation
fundingBodyID: https://ror.org/00epmv149; https://ror.org/04jnzhb65; https://ror.org/05wszs827

Refines: NA

Replaces: NA

ABCD 2.06: Here I am not sure and require support from the community
projectTitle: /DataSets/DataSet/Units/Unit/Gathering/Project/ProjectTitle
projectID: /DataSets/DataSet/Units/Unit/Gathering/Project/
fundingBodyName: /DataSets/DataSet/Metadata/Owners/Owner/Organisation/Name this is not a perfect fit as the funding body is not necessarily the owner
fundingBodyID: /DataSets/DataSet/Metadata/Owners/Owner/Organisation/ID this is not a perfect fit as the funding body is not necessarily the owner

@CecSve
Copy link

CecSve commented Oct 28, 2024

The GBIF Secretariat supports the addition of the terms, and especially projectID is urgently needed.

Could the proposal be updated with the following changes?:

Organized in Class: Occurrence

Definition of the terms (normative):

projectID: A list (concatenated and separated) of unique identifiers for the project(s) that contributed to the original dwc:Occurrence. The projectID can link multiple occurrence records associated with the same project but may be shared in different datasets. The nature of the association can be described in the metadata project description element.

Add to Usage comments: The recommended best practice is to separate the values in a list with space vertical bar space ( | ).

In general, should all fields not be defined as multivalue, as the examples also show? A project may be funded by multiple sources etc.

@aaltenburger2
Copy link
Author

aaltenburger2 commented Oct 28, 2024

Updated as suggested.

@CecSve
Copy link

CecSve commented Oct 29, 2024

Funded GBIF projects are currently mandated to share a projectTitle, based on the EML element project/title. Would it be possible to change the term from projectName to projectTitle instead?

@aaltenburger2 aaltenburger2 changed the title New Terms - projectName; projectID; fundingBodyName; fundingBodyID New Terms - projectTitle; projectID; fundingBodyName; fundingBodyID Oct 29, 2024
@aaltenburger2
Copy link
Author

Updated

@tucotuco
Copy link
Member

I have labeled this issue with "Process - need templated change request" to facilitate the discussion and management of the proposed changes. I would appreciate it if the interested parties would could separate these before submitting them for public review.

@aaltenburger2
Copy link
Author

aaltenburger2 commented Dec 18, 2024

Done. Please see
#531 for projectTitle
#532 for projectID
#533 for fundingBodyName
#534 for fundingBodyID

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants