Some trouble with the FastQC report #667

Citrusyh · 2024-04-29T14:27:23Z

Hi Felix,
I am sorry to trouble you. I’m having some trouble with the FastQC report and would like to ask you.

According to the “Per base sequence content”, should I clip the first 6 bp for good results? And this curve doesn’t look smooth.
According to the “Sequence Duplication Levels”, why is there only one line here? What’s wrong with my code?
There are so many overrepresented sequences, is it normal when dealing with RRBS data?
I had input code like this:
trim_galore
trim_galore -q 20 --phred33 --stringency 3 --length 20 -e 0.1 --paired A61.1.fq.gz A61.2.fq.gz -o /export/home/***
fastqc
fastqc -o /export/home/limiao29/RRBS/Lung/fastqc -t 12 /export/home/limiao29/RRBS/Lung/*.fq.gz

FelixKrueger · 2024-04-29T14:35:43Z

RRBS data is weird, as by definition you are only sequencing a very small subset of the genome (hence: reduced representation). Depending on the specific protocol and genome there are only a few hundred thousand possible fragments you expect to sequence, and you've got > 30 million reads. So naturally, you will sequence the same fragments several times, and evidently some of them are highly over-represented.

This isn't really something you can do much about, (maybe with the exception of deduplicating using UMIs), but it just comes with the method. The same also goes for the base composition, it is expected. The only thing that needs (hard-)trimming are the filled-in bases from the end-repair reaction. Is this by any chance the Diagenode v2 kit by any chance?

Citrusyh · 2024-04-29T14:49:19Z

I am sorry to tell you that I know little about this, because I paid for company to do this experiment. I will ask the company for more details. thank you for your kind reply!

FelixKrueger · 2024-04-29T15:01:04Z

If it happens to be the Diagenode v2 RRBS kit, there was recently a discussion as well as some processing tips here: FelixKrueger/TrimGalore#177 (comment)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some trouble with the FastQC report #667

Some trouble with the FastQC report #667

Citrusyh commented Apr 29, 2024

FelixKrueger commented Apr 29, 2024

Citrusyh commented Apr 29, 2024

FelixKrueger commented Apr 29, 2024

Some trouble with the FastQC report #667

Some trouble with the FastQC report #667

Comments

Citrusyh commented Apr 29, 2024

FelixKrueger commented Apr 29, 2024

Citrusyh commented Apr 29, 2024

FelixKrueger commented Apr 29, 2024