-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Database Download & Documentation #141
Comments
If you are calling |
@luispedro I'm trying to integrate SemiBin2 into my VEBA (https://github.com/jolespin/veba) metagenomics software suite for the prokaryotic binning module. A few questions:
|
|
@jolespin : is |
I did some test runs and it's not in the newest version. I was mostly going through all the parameters before my first run to make sure I was doing it correctly. Do you recommend a way I can precomputed abundance tables for 1 sample and for <5 samples that will be compatible with semibin2? I typically use CoverM but open to other methods. |
Gday @psj1997 @luispedro and other Semibin developers,
Firstly thanks for Semibin(2) - it works amazingly well, so many bins recovered compared to other binning methods :)
I want to share some feedback regarding database download and Semibin's documentation.
The HPC cluster I use at my institution blocks internet access on compute nodes. Therefore, lazily downloading the Semibin2 database did not work when I ran the below command (Semibin v1.5.1, Linux installation via bioconda).
It was difficult for me to figure out that this was in fact the error, because a database isn't mentioned in the readme and only in the FAQs of the docs, and the error message wasn't informative (apologies I have overwritten the log file or I would quote it).
I then tried following the FAQs of the docs to download the updated GTDB database, the following does not work in MMseqs2 v13.45111 (with this known MMSeqs2 error soedinglab/MMseqs2#561)
Then, after looking at the Semibin codebase I was able to install the database manually:
and went from there, specifying
-r {params.db}
and then semibin worked perfectly.So perhaps either including a specific
--download_database
flag or script, or just documenting a manual install method would help future users like me without compute node internet access.George
The text was updated successfully, but these errors were encountered: