question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Unusual predicted structures from pretrained OpenFold on Pascal GPU

See original GitHub issue

This is most likely some kind of local configuration error, but I haven’t been able to pin down the cause. If anyone has encountered this behavior before or has an idea of what might be wrong based on these output structures, any hints would be greatly appreciated!

Expected behavior:

run_pretrained_openfold.py outputs predicted structures comparable to AlphaFold or OpenFold Colab output.

I expected a structure similar to this unrelaxed prediction from OpenFold Colab model_1 with finetuning_1.pt:
image

Actual behavior:

My run_pretrained_openfold.py predicted structures are not similar to AlphaFold or OpenFold Colab output.

Predictions from model_1 with finetuning_1.pt (unrelaxed in tan, relaxed in blue):
image

Predictions from model_1 with params_model_1.npz:
image

Predictions from model_1 with params_model_1.npz using alignments from ColabFold MMseqs2 (ColabFold had predicted a reasonable expected structure):
image

Context:

4 x NVidia 1080-TI GPUs Using CUDA 11.3 (if other system data is relevant I can find it)

input/short.fasta

>query
MAAHKGAEHHHKAAEHHEQAAKHHHAAAEHHEKGEHEQAAHHADTAYAHHKHAEEHAAQAAKHDAEHHAPKPH

Run command:

python3 run_pretrained_openfold.py \
    input \
    data/pdb_mmcif/mmcif_files/ \
    --output_dir output \
    --cpus 16 \
    --preset reduced_dbs \
    --uniref90_database_path data/uniref90/uniref90.fasta \
    --mgnify_database_path data/mgnify/mgy_clusters_2018_12.fa \
    --pdb70_database_path data/pdb70/pdb70 \
    --uniclust30_database_path data/uniclust30/uniclust30_2018_08/uniclust30_2018_08 \
    --bfd_database_path data/bfd/bfd_metaclust_clu_complete_id30_c90_final_seq.sorted_opt \
    --model_device "cuda:0" \
    --jackhmmer_binary_path $venv_bin_dir/jackhmmer \
    --hhblits_binary_path $venv_bin_dir/hhblits \
    --hhsearch_binary_path $venv_bin_dir/hhsearch \
    --kalign_binary_path $venv_bin_dir/kalign \
    --config_preset "model_1" \
    --openfold_checkpoint_path openfold/resources/openfold_params/finetuning_1.pt

Other configurations I tried, which produced similarly strange outputs:

  • Removing --openfold_checkpoint_path to just use the AlphaFold weights
  • Using --config_preset "model_1_ptm" with finetuning_ptm_2.pt
  • Using --use_precomputed_alignments with alignment results from a previous OpenFold output
  • Using --use_precomputed_alignments with .a3m results from ColabFold
  • Using full_dbs instead of reduced_dbs

Issue Analytics

  • State:closed
  • Created a year ago
  • Comments:11 (6 by maintainers)

github_iconTop GitHub Comments

1reaction
gahdritzcommented, Jul 18, 2022

Ah my bad I never added it to the config. You’ll have to disable use_memory_efficient_kernel manually in openfold/model/evoformer.py. There should only be one occurrence of it there; change the setting from not use_lma to False.

0reactions
epenningcommented, Aug 29, 2022

I’ve confirmed that this was resolved with the fix for https://github.com/aqlaboratory/openfold/issues/172! 👏

Read more comments on GitHub >

github_iconTop Results From Across the Web

aqlaboratory/openfold: Trainable, memory-efficient ... - GitHub
OpenFold can predict the structures of sequences with more than 4000 residues on a single A100, and even longer ones with CPU offloading....
Read more >
OpenFold: A PyTorch Reproduction Of DeepMind's AlphaFold
What is OpenFold? OpenFold is a PyTorch reproduction of DeepMind's AlphaFold, a machine learning model built to automatically process protein ...
Read more >
AlphaFold Performance - UCSF RBVI
PDB Proteins / unique Total amino acids GPU memory (GB) Nvidia GPU Time (hours) Co... 6IG9 1 / 1 3744 48 A40 35 U... 6YEJ...
Read more >
Supported Applications - SBGrid Consortium
It is extremely fast (115ns/day for DHFR using 1 GPU) and performs MD simulations on ... ligand binding site prediction from protein sequence...
Read more >
Uni-Fold: An Open-Source Platform for Developing ... - bioRxiv
Recent breakthroughs on protein structure prediction, namely AlphaFold, ... We compared Uni-Fold with AlphaFold(-Multimer) and OpenFold.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found