-
Notifications
You must be signed in to change notification settings - Fork 104
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some trouble with the FastQC report #667
Comments
RRBS data is weird, as by definition you are only sequencing a very small subset of the genome (hence: reduced representation). Depending on the specific protocol and genome there are only a few hundred thousand possible fragments you expect to sequence, and you've got > 30 million reads. So naturally, you will sequence the same fragments several times, and evidently some of them are highly over-represented. This isn't really something you can do much about, (maybe with the exception of deduplicating using UMIs), but it just comes with the method. The same also goes for the base composition, it is expected. The only thing that needs (hard-)trimming are the filled-in bases from the end-repair reaction. Is this by any chance the Diagenode v2 kit by any chance? |
I am sorry to tell you that I know little about this, because I paid for company to do this experiment. I will ask the company for more details. thank you for your kind reply! |
If it happens to be the Diagenode v2 RRBS kit, there was recently a discussion as well as some processing tips here: FelixKrueger/TrimGalore#177 (comment) |
Hi Felix,
I am sorry to trouble you. I’m having some trouble with the FastQC report and would like to ask you.
I had input code like this:
trim_galore -q 20 --phred33 --stringency 3 --length 20 -e 0.1 --paired A61.1.fq.gz A61.2.fq.gz -o /export/home/***
fastqc -o /export/home/limiao29/RRBS/Lung/fastqc -t 12 /export/home/limiao29/RRBS/Lung/*.fq.gz
The text was updated successfully, but these errors were encountered: