-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
--func_annot #27
Comments
Hello Nick ! The wiki page with Both PanPhlAn 3 and HUMAnN 3 should use the same UniRef90 collection, but HUMAnN covers everything while PanPhlAn annotation files provided with the pangenome are species-specific and often contains uncharacterized (or poorly characterized) proteins. (details can be found in this preprint ) The - Hope this could help you. |
Thanks for the clarification.
That is UniRef 2019-01, correct?
So then the UniRef IDs in the PanPhlAn output should be a subset of all UniRef IDs. This doesn't really explain the low % mapping of IDs to the Humann3 mapping files.
That's good to know. What is the format?
You are right. This is more of a usage/docs question versus a bug/issue. Do you also want bugs reports on the bioBakery help forum? |
Yes both used ChocoPhlAn (our internal pipeline) based on UniRef 2019-01 That is strange indeed that a low percentage of PanPhlAn UniRef90 maps, I'll check that whenever I find the time. I've you tried mapping UniRef50 instead ?
Yes, the best would be bug report/code related stuff on GitHub and usage/general discussions on the forum, there should be more people interacting and checking it. On top of that is will be more convenient when questions concern several software at the same time |
Any updates on this? Were you able to reproduce the low UniRef90 ID mapping rate? |
Where is the docs on using UniRef50 instead of UniRef90? In the wiki, I only see info on using UniRef90. |
Hello Nick, sorry, I've been busy with other projects in the past month and I haven't check that yet. |
The wiki page on profiling shows the output as:
...but I get UniRef90 IDs for each pangenome instead of
g[0-9]{5}
(panphlan 3.1).Which version of UniRef90 are the IDs from? I tried using
map_eggnog_uniref90.txt.gz
from the HUMAnN3 utility mapping file collection (UniRef 201901), and <5% of my panphlan output UniRef ID overlap with any IDs in the mapping file, suggesting that the panphlan UniRef IDs are from a different (older?) version of UniRef.I didn't see anything in the wiki about which (biobakery) files are actually available to use with
--func_annot
. Can I use the HUMAnN3 utility mapping files?The text was updated successfully, but these errors were encountered: