Improving pre-built DeepVariant binaries for conda packages
See original GitHub issueHi all; Thanks for all the help getting an initial conda package in place for DeepVariant (#9) through bioconda.
I wanted to follow up with some suggestions that would help make the pre-built binaries more portable as part of this process, in order of helpfulness for portability:
- Currently the binaries need a recent kernel with GLIBC > 2.23 due to pre-built htslib and other libraries. Would it be possible to build the DeepVariant libraries on an older machine to allow a wider range of system support? We build on CentOS 6 in conda to provide wider compatibility.
- main.py in the zip files hardcodes python to use
/usr/bin/python
. Would it be possible to generalize this by using the python that the zip file gets called with (sys.executable
)? I currently patch this in the conda build: https://github.com/bioconda/bioconda-recipes/blob/0a2d467d63d011015efeef4b644e985297b6b271/recipes/deepvariant/build.sh#L22 - This is currently built against numpy 1.13 and ideally we’d want to sync with CONDA_NPY (1.12: https://github.com/bioconda/bioconda-recipes/blob/0a2d467d63d011015efeef4b644e985297b6b271/scripts/env_matrix.yml#L9). I believe building against 1.12 would make it forward compatible.
An alternative to points 1 and 3 is making it easier to build DeepVariant as part of the conda build process. The major blocker here is the clif
dependency which is difficult to build and the pre-built binaries require unpacking into /usr
. If we could make this relocatable and easier to install globally we could build with portable binaries and adjustable numpy as part of the bioconda preparation process.
Thanks again for all the help.
Issue Analytics
- State:
- Created 6 years ago
- Comments:41
Top Results From Across the Web
Deepvariant - :: Anaconda.org
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data. Conda · Files ......
Read more >pre-built binaries free download - SourceForge
A free and open-source application that emulates the original Microsoft Xbox game console, enabling people to play their original Xbox games on Windows,...
Read more >ifb softwares - HPC
"Origin: Conda packages" "Description: ADMIXTURE is a software tool for maximum likelihood estimation of individual ancestries from multilocus SNP genotype ...
Read more >Track Awesome Bioinformatics Updates Daily
Track Awesome Bioinformatics Updates Daily. A curated list of awesome Bioinformatics libraries and software. Home · Search · Feed · Subscribe ...
Read more >Best-of Machine Learning with Python - JAX - Model Zoo
All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package ......
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
We understand that building CLIF is a high tall for our users and putting efforts to make it easier.
Hi all, I know that this was not fully resolved, but by having it open forever also doesn’t seen very effective. I’m going to close it for now. But please do feel free to comment here. I will continue to read and reply anything here. If there are suggestions on how to re-engage this effort, also feel to let me know.