Skip to content

Commit

Permalink
Update quick_start.rst to include download instructions for reads
Browse files Browse the repository at this point in the history
  • Loading branch information
hmusta authored Jan 24, 2025
1 parent e040d5a commit a1e49c0
Showing 1 changed file with 5 additions and 2 deletions.
7 changes: 5 additions & 2 deletions metagraph/docs/source/quick_start.rst
Original file line number Diff line number Diff line change
Expand Up @@ -92,9 +92,12 @@ input and deduplicate/count/filter the input k-mers.
For example, the following commands can be used to construct a graph only from k-mers
occurring at least 5 times in the input::

# download the input reads
wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR403/SRR403017/SRR403017.fastq.gz

K=31
# count k-mers with KMC
./KMC/kmc -ci5 -t4 -k$K -m5 -fm SRR403017.fasta.gz SRR403017.cutoff_5 ./KMC
./KMC/kmc -ci5 -t4 -k$K -m5 -fq SRR403017.fastq.gz SRR403017.cutoff_5 ./KMC
# build graph with MetaGraph
metagraph build -v -p 4 -k $K -o graph SRR403017.cutoff_5.kmc_pre

Expand All @@ -121,7 +124,7 @@ with pre-counting (see :ref:`annotate_with_precounting`).
.. note::
A weighted graph can also be constructed directly from raw input sequences, without pre-counting with KMC, e.g.,::

metagraph build -v -p 4 -k 31 --count-kmers -o graph SRR403017.fasta.gz
metagraph build -v -p 4 -k 31 --count-kmers -o graph SRR403017.fastq.gz

This should be used when pre-processing with KMC is complicated or impossible, e.g., when indexing protein sequences.

Expand Down

0 comments on commit a1e49c0

Please sign in to comment.