Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

generateEvents misses splicing events

See original GitHub issue

Dear SUPPA developers,

We want to analyse data from Arabidopsis with your nice tool. The gene 'AT1G45474 have splicing events but they are not detected by SUPPA.

Do you have any idea why ?

Thanks a lot for your help.

Our call: python3 /bin/suppa/ generateEvents -i SUPPA.gtf -o generateEvents.gtf --event-type SE SS MX RI FL --exon-length 5 --boundary V --threshold 1

SUPPA.gtf ( only the rows for AT1G45474)

1	TAIR10	exon	17179302	17179537	0	+	.	gene_id "AT1G45474"; transcript_id "AT1G45474.2"
1	TAIR10	exon	17179610	17179742	0	+	.	gene_id "AT1G45474"; transcript_id "AT1G45474.2"
1	TAIR10	exon	17179828	17179940	0	+	.	gene_id "AT1G45474"; transcript_id "AT1G45474.2"
1	TAIR10	exon	17180021	17180217	0	+	.	gene_id "AT1G45474"; transcript_id "AT1G45474.2"
1	TAIR10	exon	17180297	17180439	0	+	.	gene_id "AT1G45474"; transcript_id "AT1G45474.2"
1	TAIR10	exon	17180701	17180806	0	+	.	gene_id "AT1G45474"; transcript_id "AT1G45474.2"
1	TAIR10	exon	17179302	17179537	0	+	.	gene_id "AT1G45474"; transcript_id "AT1G45474.1"
1	TAIR10	exon	17179610	17179742	0	+	.	gene_id "AT1G45474"; transcript_id "AT1G45474.1"
1	TAIR10	exon	17179828	17179940	0	+	.	gene_id "AT1G45474"; transcript_id "AT1G45474.1"
1	TAIR10	exon	17180021	17180217	0	+	.	gene_id "AT1G45474"; transcript_id "AT1G45474.1"
1	TAIR10	exon	17180297	17180530	0	+	.	gene_id "AT1G45474"; transcript_id "AT1G45474.1"

Issue Analytics

  • State:closed
  • Created 7 years ago
  • Comments:7 (4 by maintainers)

github_iconTop GitHub Comments

EduEyrascommented, Feb 8, 2017

Dear Claes,

thanks a lot for your email. Looking at the exon coordinates of your gene, I can see that transcript .1 has the same exons as transcript .2 except for the 3’ end exon, which is a bit longer. Also, the 3’ end exons from .2 is missing in .1. Is that correct?

This type of variation is not considered in the definitions by SUPPA. As the variable exon in .1 is a terminal exon, its end coordinate cannot be used as an alternative splice-site. Regarding a possible alternative polyA, it is true that is a perfectly possible variation but SUPPA only considers the case when the alternative 3’ end exon is different, but share a common upstream 5’ splice-site.

Let me know if that is what you meant. Thanks for using SUPPA


EduEyrascommented, Oct 19, 2017


On Thu, Oct 19, 2017 at 6:07 PM, Lilly88 wrote:

Dear Eduardo,

thank you for your reply. In my case I observe many more significant differentially used isoforms than events, just filtering for dPSI |0.1| and pval<=0.05 I find roughly 4 times more differentially used isoforms than events. Intuitively I would expect the opposite, but for what I understood the calculation of events is overly simplified, considering the usage of 1 exon in 1 isoform over its usage in the “closest” isoforms and not over all the isoforms with different first exons.

it can include all the others that conform to a definition of two alternatives. But in the case that alternative first exons it is true that it selects the isoform that gives the closest alternative exon.

There is a large number of alternative first exons (generic) and not all of them end up described as events in SUPPA. You probably get many more from isoforms for that reason. Also, they are generally very abundant genome-wide compared with other types of variations, at least in human.

For example, referring to the gene reported above the usage of the first exon of isoform 203 is calculated over the usage of exon 1 in isoform 201 and not over the sum of first exon of 201, 206+205, 204 and 202. As the denominator is smaller it is harder to reach significance. To me it looks like the definition of usage of first and last exon is made too simple to capture many splicing events which are presents. Is there a way to modify this?

The significance is tested at the level of differential splicing, and it could still be significant despite having a small denominator, unless you are using some expression cutoff.

We could try to include a variable number of exons between the two alternative exons to allow for these types of internal promoters. My worry is that for complex regions, this could end up producing an explosion of combinations that may obscure the interpretation of the result, whereas the isoform analysis would already be quite straightforward.

I hope this helps best


Thanks again!

— You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread .

– Dr E Eyras

ICREA Research Professor Universitat Pompeu Fabra PRBB, Dr Aiguader 88 Tel: +34 93 316 0502 (ext 1502) E08003 Barcelona, Spain Fax: +34 93 316 0550

Read more comments on GitHub >

github_iconTop Results From Across the Web

a tool for extracting alternative splicing events from RNA-seq ...
In this study, we developed a tool, Alt Event Finder, which generates splicing event annotations from RNA-seq data. Most event-based analysis, ...
Read more >
EventPointer: An effective identification of ... - Bioconductor
EventPointer: An effective identification of alternative splicing events using junction arrays and RNA-Seq data.
Read more >
Identifying and Validating Alternative Splicing Events - Affymetrix
Identifying and Validating Alternative. Splicing Events. GeneChip® Exon Arrays are powerful tools for the discovery and study of mRNA transcript diversity.
Read more >
Detection of aberrant splicing events in RNA-seq data using ...
First (left column), a splice site map is generated in an annotation-free fashion based on RNA-seq split reads. Split reads supporting exon–exon ...
Read more >
Thousands of exon skipping events differentiate among ...
Using the system, we identified putative alternative splicing events in 16 different ... Because these resources were generated using Sanger sequencing, ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Post

No results found

github_iconTop Related Hashnode Post

No results found