question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Problems with dedup output and RSEM

See original GitHub issue

Hi,

I’m using umi_tools dedup to remove PCR duplicates from an alignment to the transcriptome with STAR. After the deduplication, when I run RSEM, it seems that there are some reads from the pairs that are lost since the program exits with the following error: Read ST-E00114:1178:HFL75CCX2:7:1101:1610:55297_TTGCCATCTC: The adjacent two lines do not represent the two mates of a paired-end read! (RSEM assumes the two mates of a paired-end read should be adjacent)

The ran the command with --paired --multimapping-detection-method=NH --unpaired-reads=discard --chimeric-pairs=discard --unmapped_reads=discard

I have seen that this problem was already discussed in #384, but there is not an option on how to solve this. Do you have any idea on how to solve this issue or a workaround that could work for this case?

Thank you very much

Issue Analytics

  • State:open
  • Created 2 years ago
  • Comments:64 (28 by maintainers)

github_iconTop GitHub Comments

3reactions
IanSudberycommented, Jan 18, 2022

As for release schedule: I can’t see any reason not to include this script in the next release. We have a set of things we’d like to tidy up before the next release, but its probably not a million miles off.

2reactions
IanSudberycommented, Jul 16, 2021

Yeah, I thought that it might be coming from $options.args. There is still a potential theoretical problem with situations where read1 is alignmed but read2 isn’t, so I’ll see what I can think of for that.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Should We Remove Duplicated Reads In Rna-Seq ? - Biostar
The general consensus seems to be to NOT remove duplicates from RNA-seq. Read the biostar discussions: Duplicated reads in RNA-Seq Experiment ...
Read more >
Introduction - rnaseq » nf-core
RNA sequencing analysis pipeline using STAR, RSEM, HISAT2 or Salmon with gene/isoform ... This document describes the output produced by the pipeline.
Read more >
STAR manual 2.7.0a
One option to avoid this problem is to install gcc (preferably ... By default, the output satisfies RSEM requirements: soft-clipping or ...
Read more >
Documentation: MultiQC
Problems caused by this will typically be discovered be fewer results than ... This module search for the file .cnt created by RSEM...
Read more >
BBMerge Guide - DOE Joint Genome Institute
If you use an option such as extend, extend2, ecct, kfilter, rem, or rsem, ... can cause problems in assembly or clustering (leading...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found