-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Archive additional metadata and other information #121
Archive additional metadata and other information #121
Conversation
Also, let me know if this should go here or in the regional-mom6-xml repo |
@andrew-c-ross, thank you for adding the extra metadata to the archive. I will review it more closely later, but at first glance, I didn’t notice any major issues. I believe it would be beneficial to apply these changes to our internal repository as well, as it would allow other developing regions to benefit from the extra metadata in the archives. |
Thanks @andrew-c-ross @yichengt900! I will have a global attribute of |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@andrew-c-ross, the section on metadata and additional information looks good to me. I think it covers almost everything. I only found a few small typos. I will leave this PR open a little longer so others have time to review and provide any feedback.
Thanks, I should spellcheck before I copy and paste 🤦 |
I like this idea. Saving the commit hash for each module seems especially helpful for reproducing old simulations since defaults can change over time. I think moving the parameter doc iles into the meta data folder also makes sense. Should we use this as an opportunity to rename Is the |
Looking into this, I wonder if we even need the extra.results or if it is a remnant of a very old version of FRE. This is the regex FRE uses to pick up ascii files:
It seems like this could catch the stats and velocity truncation files.
My thinking was we are archiving the XML now, and currently we always include the full field table in the XML rather than linking a file. But the actual field table file could easily be added if we wanted it. |
If the field table is in the xml that should be good enough. I would suggest we test if |
Good catch, @andrew-c-ross. I can confirm that the new FRE now willl archive stats files as well as velocity_truncation files (if you name them like U.velocity_truncations). |
I think they are currently named like U_velocity_truncations. Does that not get caught? It looks like the name can be changed with the |
@andrew-c-ross, yes, we can always change file names in the MOM_input options. The issue is that the new FRE treats "velocity_truncation" as a file extension, so unfortunately, it cannot recognize |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@andrew-c-ross, I have tested the new changes and can confirm that they successfully archive the metadata and additional information as proposed. Approved. I'll wait a bit to see if there is any further feedback. Thank you again for this contribution!
I will merge this PR for now. If any metadata-related issues arise or concerns are raised later, we can always revisit them. We'll also add these changes to our private XML repository later. |
@charliestock and I have been talking with the data portal team about metadata to include in the latest files that will be served on the CEFI data portal. We largely settled on adding an attribute to the published netcdf files pointing to the path of the data on /archive, and then storing additional metadata and other information in /archive. This PR adds some of that extra metadata to a new
metadata.out/
directory that gets included in the ascii tar file. The main additions here are archiving the model XML file itself and the commit hash for each git submodule in the source code.The new
git_submodule_status
file that gets archived will look something like this; it is the result of runninggit submodule status --recursive
:This PR could also be a good place to discuss if there are other files that should be added to the archive.