generateEvents misses splicing events
See original GitHub issueDear SUPPA developers,
We want to analyse data from Arabidopsis with your nice tool. The gene 'AT1G45474 have splicing events but they are not detected by SUPPA.
Do you have any idea why ?
Thanks a lot for your help.
Our call:
python3 /bin/suppa/suppa.py generateEvents -i SUPPA.gtf -o generateEvents.gtf --event-type SE SS MX RI FL --exon-length 5 --boundary V --threshold 1
SUPPA.gtf ( only the rows for AT1G45474)
1 TAIR10 exon 17179302 17179537 0 + . gene_id "AT1G45474"; transcript_id "AT1G45474.2"
1 TAIR10 exon 17179610 17179742 0 + . gene_id "AT1G45474"; transcript_id "AT1G45474.2"
1 TAIR10 exon 17179828 17179940 0 + . gene_id "AT1G45474"; transcript_id "AT1G45474.2"
1 TAIR10 exon 17180021 17180217 0 + . gene_id "AT1G45474"; transcript_id "AT1G45474.2"
1 TAIR10 exon 17180297 17180439 0 + . gene_id "AT1G45474"; transcript_id "AT1G45474.2"
1 TAIR10 exon 17180701 17180806 0 + . gene_id "AT1G45474"; transcript_id "AT1G45474.2"
1 TAIR10 exon 17179302 17179537 0 + . gene_id "AT1G45474"; transcript_id "AT1G45474.1"
1 TAIR10 exon 17179610 17179742 0 + . gene_id "AT1G45474"; transcript_id "AT1G45474.1"
1 TAIR10 exon 17179828 17179940 0 + . gene_id "AT1G45474"; transcript_id "AT1G45474.1"
1 TAIR10 exon 17180021 17180217 0 + . gene_id "AT1G45474"; transcript_id "AT1G45474.1"
1 TAIR10 exon 17180297 17180530 0 + . gene_id "AT1G45474"; transcript_id "AT1G45474.1"
Issue Analytics
- State:
- Created 7 years ago
- Comments:7 (4 by maintainers)
Top Results From Across the Web
a tool for extracting alternative splicing events from RNA-seq ...
In this study, we developed a tool, Alt Event Finder, which generates splicing event annotations from RNA-seq data. Most event-based analysis, ...
Read more >EventPointer: An effective identification of ... - Bioconductor
EventPointer: An effective identification of alternative splicing events using junction arrays and RNA-Seq data.
Read more >Identifying and Validating Alternative Splicing Events - Affymetrix
Identifying and Validating Alternative. Splicing Events. GeneChip® Exon Arrays are powerful tools for the discovery and study of mRNA transcript diversity.
Read more >Detection of aberrant splicing events in RNA-seq data using ...
First (left column), a splice site map is generated in an annotation-free fashion based on RNA-seq split reads. Split reads supporting exon–exon ...
Read more >Thousands of exon skipping events differentiate among ...
Using the system, we identified putative alternative splicing events in 16 different ... Because these resources were generated using Sanger sequencing, ...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Dear Claes,
thanks a lot for your email. Looking at the exon coordinates of your gene, I can see that transcript .1 has the same exons as transcript .2 except for the 3’ end exon, which is a bit longer. Also, the 3’ end exons from .2 is missing in .1. Is that correct?
This type of variation is not considered in the definitions by SUPPA. As the variable exon in .1 is a terminal exon, its end coordinate cannot be used as an alternative splice-site. Regarding a possible alternative polyA, it is true that is a perfectly possible variation but SUPPA only considers the case when the alternative 3’ end exon is different, but share a common upstream 5’ splice-site.
Let me know if that is what you meant. Thanks for using SUPPA
E.
Hi,
On Thu, Oct 19, 2017 at 6:07 PM, Lilly88 notifications@github.com wrote:
it can include all the others that conform to a definition of two alternatives. But in the case that alternative first exons it is true that it selects the isoform that gives the closest alternative exon.
There is a large number of alternative first exons (generic) and not all of them end up described as events in SUPPA. You probably get many more from isoforms for that reason. Also, they are generally very abundant genome-wide compared with other types of variations, at least in human.
We could try to include a variable number of exons between the two alternative exons to allow for these types of internal promoters. My worry is that for complex regions, this could end up producing an explosion of combinations that may obscure the interpretation of the result, whereas the isoform analysis would already be quite straightforward.
I hope this helps best
Eduardo
– Dr E Eyras
ICREA Research Professor Universitat Pompeu Fabra PRBB, Dr Aiguader 88 Tel: +34 93 316 0502 (ext 1502) E08003 Barcelona, Spain Fax: +34 93 316 0550
http://scholar.google.com/citations?user=LiojlGoAAAAJ http://www.researcherid.com/rid/L-1053-2014 http://regulatorygenomics.upf.edu/