question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Multiple chromosomes

See original GitHub issue

@DomNelson and I were just discussing the possibility of adding support for multiple chromosomes to tskit. One possibility which seems like it might be a reasonably smooth path forward is the following:

  1. Add a chromosome table with an ID, name, length and metadata.
  2. Add a chromosome column to the edge table. Within a chromosome, coordinates must be from 0 to the chromosome’s length.
  3. Either deprecate the sequence_length property (probably best), or make it equal to the sum of the length of all chromosomes.

For things like trees() we could add an optional chromosome argument. If the tree sequence contains multiple chromosomes which would raise an error if it’s not specified. For tree sequences with a single chromosome, things would continue to work as now.

Any thoughts @petrelharp, @hyanwong, @bhaller, @molpopgen?

Issue Analytics

  • State:open
  • Created 4 years ago
  • Comments:32 (29 by maintainers)

github_iconTop GitHub Comments

2reactions
petrelharpcommented, Apr 23, 2019

In terms of simplification then, each chromosome would be simplified separately,

I think you all are thinking about this, but if you simplify each chromosome separately, we’ll have to do an extra “merge” step at the end to ensure that the node tables agree across chromosomes. It would be nice to be able to parallelize, though!

I also lean towards ‘no thanks’. An extra benefit of sticking all chromosomes end-to-end is that it makes plotting easier.

1reaction
jeromekellehercommented, Apr 18, 2019

I’m with @bhaller here on the ‘not right now, thanks’ assessment. The idea came up during a discussion and I wanted to get some feedback on what would be involved and how much hassle it would be. This isn’t a small change, so we definitely wouldn’t make it lightly, without considering performance implications and taking the impact on downstream users into account.

Let’s keep the issue open for discussion anyway though.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Chromosome Abnormalities Fact Sheet
Numerical Abnormalities: When an individual is missing one of the chromosomes from a pair, the condition is called monosomy. When an individual ...
Read more >
Chromosome dynamics in multichromosome bacteria - PMC
Bacteria with multiple chromosomes have been found in diverse prokaryotic phyla including Actinobacteria, Chloroflexi, Deinococcus-Thermus, ...
Read more >
Evolution of multiple sex-chromosomes associated ... - Nature
Our results suggest that the multiple sex-chromosome system evolved in a common ancestor as a result of dynamic genome reshuffling through ...
Read more >
B chromosomes of multiple species have intense evolutionary ...
One of the biggest challenges in chromosome biology is to understand the occurrence and complex genetics of the extra, non-essential ...
Read more >
Multiple Chromosomes in Bacteria: Low Level of Evolutionary ...
(2019) Multiple Chromosomes in Bacteria: Low Level of Evolutionary Constraint Drives the Rapid Genetic Divergence of Chromosome II. Advances in Microbiology, 9, ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found