Is there a way to run subset-bam faster #34

mattli7 · 2023-08-07T23:31:31Z

Hello,

I am trying to subset my bam file for each barcode. I have around 20k cells and each of their barcode is in a directory. I have been using the code below to execute subset-bam. It takes around 35 minutes per barcode. I was wondering if there is a way to make subset-bam run any faster, perhaps parallelization?

FILES="my directory containing every barcode"
for file in $FILES
do
filename=$(basename "$file")
filename_no_extension="${filename%%.*}"

subset-bam_linux --bam marked.duplicates.bam --bam-tag CB --cell-barcodes barcodes/$filename --out-bam barcode_bams/$filename_no_extension.bam
done

ghuls · 2024-01-30T15:02:42Z

See: #17 (comment)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there a way to run subset-bam faster #34

Is there a way to run subset-bam faster #34

mattli7 commented Aug 7, 2023

ghuls commented Jan 30, 2024

Is there a way to run subset-bam faster #34

Is there a way to run subset-bam faster #34

Comments

mattli7 commented Aug 7, 2023

ghuls commented Jan 30, 2024