question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Requirements for UKB GWAS

See original GitHub issue

To run a basic GWAS on UKB data, here are some of the operations we’ll need support for:

There may be a few more beyond that, but I think anything remaining should be reasonable with Xarray/Dask alone.

Issue Analytics

  • State:open
  • Created 3 years ago
  • Comments:9 (5 by maintainers)

github_iconTop GitHub Comments

2reactions
hammercommented, Sep 4, 2020

A phenotype normalization pipeline.

@eric-czech I just had a nice chat with @zietzm and @ntatonetti who are at Columbia and are experts in handling complex phenotypes and running many GWAS against them.

They’re interested in using sgkit and possibly contributing back, particularly on the phenotype side.

Would you be open to making https://github.com/related-sciences/ukb-gwas-pipeline-nealelab public soon and potentially working with @zietzm to factor the phenotype handling code into its own repo, maybe something like sgkit-pheno or phenokit?

1reaction
hammercommented, Jul 27, 2020

A phenotype normalization pipeline.

I’ve been thinking about this one too. I think we’re going to feel Dask’s poor handling of nested data when working with phenotypes, and I’d prefer to keep Spark out of this project as a dependency, so I think we put that code into a separate repo if we find we do need Spark.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Genome-wide association studies - GWAS Catalog
Studies are eligible for inclusion in the GWAS Catalog if they meet the following criteria: Include a primary GWAS analysis, defined as array-based...
Read more >
Details and considerations of the UK Biobank GWAS - Neale lab
Furthermore, under a basic 2x2 testing framework (i.e. case/control x allele A/allele B), a fully penetrant allele would require around 25 ...
Read more >
Selecting Primary UKB Round 2 Phenotypes for LDSR Analysis
The UKB Round 2 GWAS contains 11685 GWAS of 4236 unique phenotype codes (3011 PHESANT + 559 FinnGen + 633 ICD10 + 31...
Read more >
How to run GWAS from UK Biobank efficiently on Hail
initiate hail environment · import phenotype info · import genotype information · add mfi information to variants · varinat qc to get allele ......
Read more >
Comprehensive genomic analysis of dietary habits in UK ...
Dietary habit GWAS in UKB identifies 814 independent loci ... We have complied with all relevant ethical regulations for work with UKB and ......
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found