MMSplice error and general kipoi questions
See original GitHub issueHello,
I’m quite new to kipoi, but I should praise your effort in advance for such great resource and concept.
I want to employ kipoi to study predictions of Splicing models. To start, I decided to go for MMSplice, but it seems my setup isn’t working. I followed recommendations by creating a specific environment for all MMSplcie modules
However, when I run the simplest case on a set of variants I get the following error:
(kipoi-MMSplice) pedro.barbosa@lobo-2:~/resources/ kipoi veff score_variants MMSplice/pathogenicity -i variants.vcf.gz -o test.vcf
Already up-to-date.
Using TensorFlow backend.
/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/lib/python3.5/site-packages/sklearn/externals/joblib/__init__.py:15: DeprecationWarning: sklearn.externals.joblib is deprecated in 0.21 and will be removed in 0.23. Please import this functionality directly from joblib, which can be installed with: pip install joblib. If this warning is raised when loading pickled models, you may need to re-serialize those models with scikit-learn 0.21+.
warnings.warn(msg, category=DeprecationWarning)
/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/lib/python3.5/site-packages/sklearn/base.py:306: UserWarning: Trying to unpickle estimator HuberRegressor from version 0.19.2 when using version 0.21.3. This might lead to breaking code or invalid results. Use at your own risk.
UserWarning)
/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/lib/python3.5/site-packages/sklearn/base.py:306: UserWarning: Trying to unpickle estimator StandardScaler from version 0.19.2 when using version 0.21.3. This might lead to breaking code or invalid results. Use at your own risk.
UserWarning)
/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/lib/python3.5/site-packages/sklearn/base.py:306: UserWarning: Trying to unpickle estimator LogisticRegression from version 0.19.2 when using version 0.21.3. This might lead to breaking code or invalid results. Use at your own risk.
UserWarning)
/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/lib/python3.5/site-packages/sklearn/base.py:306: UserWarning: Trying to unpickle estimator Pipeline from version 0.19.2 when using version 0.21.3. This might lead to breaking code or invalid results. Use at your own risk.
UserWarning)
2019-08-08 16:40:29.661278: I tensorflow/core/platform/cpu_feature_guard.cc:142] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
2019-08-08 16:40:29.668075: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2200050000 Hz
2019-08-08 16:40:29.668911: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x5657620 executing computations on platform Host. Devices:
2019-08-08 16:40:29.668935: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined>
2019-08-08 16:40:29.692338: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile.
/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/lib/python3.5/site-packages/keras/engine/saving.py:292: UserWarning: No training configuration found in save file: the model was *not* compiled. Compile it manually.
warnings.warn('No training configuration found in save file: '
/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/lib/python3.5/site-packages/keras/engine/saving.py:292: UserWarning: No training configuration found in save file: the model was *not* compiled. Compile it manually.
warnings.warn('No training configuration found in save file: '
/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/lib/python3.5/site-packages/keras/engine/saving.py:292: UserWarning: No training configuration found in save file: the model was *not* compiled. Compile it manually.
warnings.warn('No training configuration found in save file: '
Traceback (most recent call last):
File "/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/bin/kipoi", line 11, in <module>
sys.exit(main())
File "/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/lib/python3.5/site-packages/kipoi/__main__.py", line 105, in main
command_fn(args.command, sys.argv[2:])
File "/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/lib/python3.5/site-packages/kipoi_veff/__main__.py", line 11, in cli_main
kipoi_veff.cli.cli_main(command, raw_args)
File "/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/lib/python3.5/site-packages/kipoi_veff/cli.py", line 458, in cli_main
command_fn(args.command, raw_args[1:])
File "/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/lib/python3.5/site-packages/kipoi_veff/cli.py", line 192, in cli_score_variants
model_info = kipoi_veff.ModelInfoExtractor(model, Dl)
File "/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/lib/python3.5/site-packages/kipoi_veff/utils/generic.py", line 318, in __init__
self.seq_fields = _get_seq_fields(model_obj)
File "/home/pedro.barbosa/software/miniconda3/envs/kipoi-MMSplice/lib/python3.5/site-packages/kipoi_veff/utils/generic.py", line 451, in _get_seq_fields
raise Exception("Model does not support var_effect_prediction")
Exception: Model does not support var_effect_prediction
Any help ?
Additionally, I would like to have a seamless way to run multiple models within the same environment. I checked and pulled kipoi/models docker image, but I’m not sure if all models dependencies are solved there. Do you have any tutorial on how to run models/ score variants using docker containers ? That would be ideal, since it is the preferred way to run things in my cluster.
Last but not least, what’s the biggest difference between kipoi predict vs kipoi veff score_variants for a model that is designed to predict effects of genomic variants ?
Thanks in advance, Pedro Barbosa
Issue Analytics
- State:
- Created 4 years ago
- Comments:33 (16 by maintainers)
Top GitHub Comments
Dear @Avsecz ,
Testing now HAL on clinvar file gives me a parser error
My VCF contains SIFT and Polyphen annotations added by VEP (that add string characters to the corresponding fields), i don’t know if this could be the problem:
|tolerated(1)|benign(0.003)
Ah, that is useful, thanks. It makes sense now.
I checked for that. Both have the ‘chr’ strings, but the problem remains. I removed chr myself, and it appears to be working (VCF file is annotated with KV:kipoi:HAL:DIFF and KV:kipoi:HAL:rID fields). However, few variants are scored. I believe it is because HAL can’t predict arbitrary across the whole genome, right? Is that the reason I get the following warning?
no intervals found for b'/mnt/nfs/lobo/MCFONSECA-NFS/pedro.barbosa/resources/variants.vcf.gz' at None:107087180-107087339
I would like to know more about HAL. In the paper, they refer the model predicts the effect of variants (SNPs, indels) on different isoform usage from alternative splicing events (alternative 5’ , alternative 3’ and Exon skipping events). When I run
kipoi predict HAL
, which just requires a fasta and a gtf I get several predictions for each feature present in my gtf. They seem vague to me, apparently it predicts PSI of alternative Splice acceptor events, but the genomic coordinates (metadata/ranges/start and metadata/ranges/end columns in the tsv output) doesn’t seem to match exon/intron boundaries present in the gtf. What is HAL actually predicting here? Please find below an example of a transcript:In addition, in the model page (http://splicing.cs.washington.edu/) , they seem to provide utilities to predict variants that influence alternative 5’ss and exon skipping, but kipoi refers to 3’ss. Was that done on purpose in the yaml file ?
Thanks in advance, Pedro Barbosa