question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Intel MKL FATAL ERROR in 20.11 and 20.12

See original GitHub issue

Description A clear and concise description of what the bug is.

  • When trying to run a PyTorch script which was traced on CPU, the error: Intel MKL FATAL ERROR: Cannot load libmkl_avx2.so or libmkl_def.so is thrown. This is a CPU script that is working on 20.10, but fails on 20.11 and 20.12. After looking around a bit, I was able to get around that error by setting LD_PRELOAD="/opt/tritonserver/backends/pytorch/libmkl_avx2.so:/opt/tritonserver/backends/pytorch/libmkl_def.so", but now a different but similar error is thrown: Intel MKL FATAL ERROR: cannot load libmkl_vml_avx2.so or libmkl_vml_def.so..

When looking in the 20.12 container, I get nothing:

root@py-svc-triton-server-669bf765b-svvzm:/opt/tritonserver# find . -name '*libmkl_vml*'
root@py-svc-triton-server-669bf765b-svvzm:/opt/tritonserver# 

When looking in the 20.10 container, I can find one of the files from that error (which seems to be enough):

root@py-svc-triton-server-f744f4657-mqmk5:/opt/tritonserver# find . -name '*libmkl_vml*'
./lib/pytorch/libmkl_vml_def.so
root@py-svc-triton-server-f744f4657-mqmk5:/opt/tritonserver# 

Triton Information What version of Triton are you using?

  • Currently using 20.10 and trying to upgrade to 20.12 Are you using the Triton container or did you build it yourself?
  • Using the triton container

To Reproduce Steps to reproduce the behavior.

Describe the models (framework, inputs, outputs), ideally include the model configuration file (if using an ensemble include the model configuration file for that as well).

  • Using a cpu traced PyTorch script using torch==1.5.

Expected behavior A clear and concise description of what you expected to happen.

  • Expect a cpu trace PyTorch script which ran on 20.10 to continue working on 20.11 and 20.12.

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:11 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
Debangshu132commented, Jan 6, 2021

Same issue here, does not work in 20.11 or 20.12 but works fine in 20.10

0reactions
HarmitMinhas96commented, Jan 13, 2021

@CoderHam Thank you for the fix! Sorry I was not able to get back with a reproducible example

Read more comments on GitHub >

github_iconTop Results From Across the Web

Intel MKL FATAL ERROR: Cannot load libmkl_intel_thread ...
This when I get the error listed above. Please help. This is the code file below: import pandas as pd from keras.models import ......
Read more >
Intel MKL FATAL ERROR: Cannot load libmkl_avx2.so or ...
Hi I have installed Fupermod software with MKL support. It installed correctally. But when I run the program it give me that error...
Read more >
[오류해결] INTEL MKL ERROR: 지정된 모듈을 찾을 수 ...
오류명 : INTEL MKL ERROR: 지정된 모듈을 찾을 수 없습니다. mkl_intel_thread.1.dll. Intel MKL FATAL ERROR: Cannot load mkl_intel_thread.1.dll.
Read more >
Package List — Spack 0.20.0.dev0 documentation
cachefilesd, perl-test-fatal, r-affyio ... intel-mkl, py-iso8601, r-tweenr ... 21.09, 21.08, 21.07, 21.06, 21.05, 21.04, 21.03, 21.02, 21.01, 20.12, 20.11, ...
Read more >
Intel MKL FATAL ERROR when running Anaconda projects in ...
Try to update mkl to 2018.0.2-1 or later will fix this problem. 0. Please sign in to ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found