Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there a way to run subset-bam faster #34

Open
mattli7 opened this issue Aug 7, 2023 · 1 comment
Open

Is there a way to run subset-bam faster #34

mattli7 opened this issue Aug 7, 2023 · 1 comment

Comments

@mattli7
Copy link

mattli7 commented Aug 7, 2023

Hello,

I am trying to subset my bam file for each barcode. I have around 20k cells and each of their barcode is in a directory. I have been using the code below to execute subset-bam. It takes around 35 minutes per barcode. I was wondering if there is a way to make subset-bam run any faster, perhaps parallelization?

FILES="my directory containing every barcode"
for file in $FILES
do
filename=$(basename "$file")
filename_no_extension="${filename%%.*}"

subset-bam_linux --bam marked.duplicates.bam --bam-tag CB --cell-barcodes barcodes/$filename --out-bam barcode_bams/$filename_no_extension.bam
done

@ghuls
Copy link

ghuls commented Jan 30, 2024

See: #17 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants