Multiple chromosomes
See original GitHub issue@DomNelson and I were just discussing the possibility of adding support for multiple chromosomes to tskit. One possibility which seems like it might be a reasonably smooth path forward is the following:
- Add a
chromosome
table with an ID,name
,length
andmetadata
. - Add a
chromosome
column to the edge table. Within a chromosome, coordinates must be from 0 to the chromosome’s length. - Either deprecate the
sequence_length
property (probably best), or make it equal to the sum of the length of all chromosomes.
For things like trees()
we could add an optional chromosome
argument. If the tree sequence contains multiple chromosomes which would raise an error if it’s not specified. For tree sequences with a single chromosome, things would continue to work as now.
Any thoughts @petrelharp, @hyanwong, @bhaller, @molpopgen?
Issue Analytics
- State:
- Created 4 years ago
- Comments:32 (29 by maintainers)
Top Results From Across the Web
Chromosome Abnormalities Fact Sheet
Numerical Abnormalities: When an individual is missing one of the chromosomes from a pair, the condition is called monosomy. When an individual ...
Read more >Chromosome dynamics in multichromosome bacteria - PMC
Bacteria with multiple chromosomes have been found in diverse prokaryotic phyla including Actinobacteria, Chloroflexi, Deinococcus-Thermus, ...
Read more >Evolution of multiple sex-chromosomes associated ... - Nature
Our results suggest that the multiple sex-chromosome system evolved in a common ancestor as a result of dynamic genome reshuffling through ...
Read more >B chromosomes of multiple species have intense evolutionary ...
One of the biggest challenges in chromosome biology is to understand the occurrence and complex genetics of the extra, non-essential ...
Read more >Multiple Chromosomes in Bacteria: Low Level of Evolutionary ...
(2019) Multiple Chromosomes in Bacteria: Low Level of Evolutionary Constraint Drives the Rapid Genetic Divergence of Chromosome II. Advances in Microbiology, 9, ...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
I think you all are thinking about this, but if you simplify each chromosome separately, we’ll have to do an extra “merge” step at the end to ensure that the node tables agree across chromosomes. It would be nice to be able to parallelize, though!
I also lean towards ‘no thanks’. An extra benefit of sticking all chromosomes end-to-end is that it makes plotting easier.
I’m with @bhaller here on the ‘not right now, thanks’ assessment. The idea came up during a discussion and I wanted to get some feedback on what would be involved and how much hassle it would be. This isn’t a small change, so we definitely wouldn’t make it lightly, without considering performance implications and taking the impact on downstream users into account.
Let’s keep the issue open for discussion anyway though.