Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pull data from "SARS-CoV-2 Sequence Data from Germany" #331

Closed
joverlee521 opened this issue Jul 29, 2022 · 3 comments
Closed

Pull data from "SARS-CoV-2 Sequence Data from Germany" #331

joverlee521 opened this issue Jul 29, 2022 · 3 comments
Labels
enhancement New feature or request

Comments

@joverlee521
Copy link
Contributor

Context

Similar to #329, there has been a significant drop off in sequences from Germany in the NCBI data since ~April 2022 (this issue was originally raised by @corneliusroemer in Slack):

Screen Shot 2022-07-29 at 10 40 29 AM

Description

We can update the open pipeline to pull metadata and sequences directly from the "SARS-CoV-2 Sequence Data from Germany" GitHub repo.

Possible solution

Similar solutions from #329 will apply here.

@joverlee521
Copy link
Contributor Author

I think the solution for directly pulling RKI sequences will be similar to my ideas for the COG-UK data.

The different step here might be how to remove the RKI sequences from the GenBank data. I have not found an accession linkage file for the RKI data. However, we can do a blanket removal of all sequences linked to the RKI BioProject.

@corneliusroemer
Copy link
Member

We could simply remove all German sequences uploaded to Genbank from March 2022 onwards and only spike from Germany's repo from that date onwards as a quick fix.
This would be 80/20, more effort may not be worth it.

@joverlee521
Copy link
Contributor Author

Resolved by #365

Repository owner moved this from Backlog to Done in Nextstrain planning (archived) Dec 13, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
No open projects
Development

No branches or pull requests

2 participants