question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

getting runtime error when trying to run on a gpu node

See original GitHub issue

Please run microk8s inspect and attach the generated tarball to this issue. Microk8s v.1.21 Error: Failed to create pod sandbox: rpc error: code = Unknown desc = failed to get sandbox runtime: no runtime for “nvidia” is configured

Troubleshooting:

  1. followed installation process here
  2. tested deployment with microk8s enable gpu

We appreciate your feedback. Thank you for using microk8s. image

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:5 (2 by maintainers)

github_iconTop GitHub Comments

1reaction
joedborgcommented, Jul 30, 2021

You’re welcome @inohmonton99, thanks for raising the issue. Yes, MicroK8s will detect whether or not you have drivers already installed and configure the NVIDIA operator accordingly.

1reaction
inohmonton99commented, Jul 29, 2021

everything’s working now. thanks for the quick solution @joedborg , also with latest/edge I didn’t need to install NVIDIA drivers only needed to enable gpu and it installed everything. Awesome!

Read more comments on GitHub >

github_iconTop Results From Across the Web

How to dig the nvidia-container-runtime error? #248 - GitHub
I want to run nvidia-container-runtime from gpu-operator. But CrashLoopBack is occured. When I check the kubectl logs shows following error.
Read more >
CUDA peer resources error when running on more than 8 ...
When running examples on more than 8 K80s, we are getting errors from CUDA like: cuda runtime error (60) : peer mapping resources...
Read more >
Runtime error 999 when trying to use cuda with pytorch
For me this error occurs when my Laptop went to sleep mode before. Restarting solved the problem. – nick. Dec 2, 2019 at...
Read more >
CUDA error when loading my model - PyTorch Forums
I'm trying to load a model and run it on GPU nodes. The code snippet is: import torch from torch import nn class...
Read more >
NodeODM:GPU exited with strange error code - ODM
I'm trying nodeodm:gpu with a small dataset of 50 images and it's failing with the following message in the console output: · [ERROR]...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found