question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

smart-seq2 stranded

See original GitHub issue

Hi Developers,

Below is an excerpt of logic SmartSeq2:

class SmartSeq2(Logic):
    """SmartSeq2 logic
    """

    def __init__(self) -> None:
        self.name = "SmartSeq2"

    @property
    def layers(self) -> List[str]:  # This should be overridden if a different set of layers is desired
        return ["spliced", "unspliced", "ambiguous", "spanning"]

    @property
    def stranded(self) -> bool:
        return False

I wonder if it true that SmartSeq2 will treat reads as non-stranded, even for stranded data? Thanks.

Issue Analytics

  • State:closed
  • Created 5 years ago
  • Comments:6 (3 by maintainers)

github_iconTop GitHub Comments

1reaction
luyoutaocommented, Jun 9, 2018

Thanks to your answer. That explains my observation that, when I tried to feed velocyto (with SmartSeq2 logic) a bam file with only R1 or R2, I got half number of counts as the entire bam file.

This behavior seems unexpected to me, because mainstream paired-end quantification tools avoid such double counting scheme. Here is an example from HTSeq document (http://htseq.readthedocs.io/en/master/count.html):

For paired-end data, does htseq-count count reads or read pairs? Read pairs. The script is designed to count “units of evidence” for gene expression. If both mates map to the same gene, this still only shows that one cDNA fragment originated from that gene. Hence, it should be counted only once.

0reactions
luyoutaocommented, Jun 12, 2018

Thanks again for your confirmation.

Sure. Bam file has read pair (among others) information encoded in “flag” (https://broadinstitute.github.io/picard/explain-flags.html): 0x40 indicates R1 and 0x80 R2. In terms of paired-end quantification logic, there are two widely used packages HTSeq (https://github.com/simon-anders/htseq/tree/master) and FeatureCount (http://subread.sourceforge.net/) you might refer to. Essentially, both package assigns a count to a gene, if one or both mates are aligned.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Smart-Seq2 - Illumina
mRNA sequence does not have to be known; Improved coverage across transcripts; High level of mappable reads. Cons: Not strand-specific; No early multiplexing...
Read more >
Smart-Seq2 - Enseqlopedia
After the first-strand reaction, the cDNA is amplified using a limited number of cycles. Next, tagmentation is used to construct sequencing ...
Read more >
Smart-seq2 Single Sample Overview | WARP - GitHub Pages
The pipeline has been validated to support human and mouse, stranded or unstranded, paired- or single-end, and plate- or fluidigm-based Smart-seq2 datasets.
Read more >
Description of Smart-seq2 scRNA Sequencing
Overview of Smart-seq2 scRNA sequencing in the HCA DCP. ... able to switch templates and synthesize the complementary cDNA strand using the TSO...
Read more >
SMARTer single cell total RNA sequencing - Oxford Academic
Here, we developed a novel single cell strand-specific total RNA library ... Smart-seq2 generates read coverage across the whole transcript expanding the ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found