5' adapter not trimmed if read ends early
See original GitHub issueGiven a sequence with 5’ adapter, eg ALONGADAPTORsequence, if sequence is low quality in the end, or has polyG, cutadapt will trim this sequence into ALONGADAPTORseq (1st case) or ALONGADAP (2nd case). Then the -g
argument and remove the adapter in the 1st case, but not in the 2nd case. And will cause adaptor contamination in the filtered reads.
_Originally posted by @yech1990 in https://github.com/marcelm/cutadapt/issues/550#issuecomment-921100958_
Issue Analytics
- State:
- Created 2 years ago
- Reactions:2
- Comments:6 (3 by maintainers)
Top Results From Across the Web
Why are adapter sequences trimmed from only the 3' ends of ...
Libraries prepared with Illumina library prep kits require adapter trimming only on the 3' ends of reads, because adapter sequences are not found...
Read more >Trimming left end (5') of reads?? - SEQanswers
My understanding was that adapter contamination mainly arises when the read is too short so at the end of the read the sequencer...
Read more >User guide — Cutadapt 4.2 documentation - Read the Docs
The 3' adapter in the last read is not trimmed because the anchored 5' adapter is required, but missing in the read. Linked...
Read more >Trimming adapter sequences - is it necessary? - Biostars
Trimming adapter sequences - is it necessary? Removal of adapter sequences in a process called read trimming, or clipping, is one of the...
Read more >Trimming adapter sequences - is it necessary? - ecSeq
Removal of adapter sequences in a process called read trimming, or clipping, ... that the 5' adapters will not appear in the sequenced...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
thank you very much. I found this argument in the document now. 😂
On Fri, Oct 1, 2021, 04:14 Marcel Martin @.***> wrote:
Thanks for reporting, this is indeed a problem that should be fixed.
Some notes to myself and/or those interested in the details.
--no-indels
, which uses a different algorithm.ADAPTER
will be found inPTERSEQUENCE
, but not inSEQUENCEADAPT
. This is necessary because otherwise short, random matches at the 3’ end would lead to the entire read being trimmed. It is also symmetrical to how 3’ adapters are aligned. (For those, partial occurrences at the 5’ end are not allowed.)One test case that should work is to find a regular 5’ adapter
ADAPTER
, allowing 1 error (ignoring error rate), withinTADAPT
. The alignment could look like this: