Azure Autoscaler - Permission denied connecting to Docker socket - Head Node Setup
See original GitHub issueRay RLLib Version 2.0.0.dev0
When running ray up
for an Azure deployment, the following occurs:
[5/7] Initalizing command runner
Shared connection to 20.71.92.41 closed.
Got permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Post http://%2Fvar%2Frun%2Fdocker.sock/v1.40/images/create?fromImage=rayproject%2Fray-ml&tag=latest-gpu: dial unix /var/run/docker.sock: connect: permission denied
Shared connection to 20.71.92.41 closed.
New status: update-failed
!!!
SSH command failed.
!!!
Failed to setup head node.
Issue Analytics
- State:
- Created 3 years ago
- Comments:15 (11 by maintainers)
Top Results From Across the Web
How to fix docker: Got permission denied while trying to ...
I've just installed docker but I have to run it with sudo every time. If I don't add sudo I get the following...
Read more >Errors connecting to Docker socket (#2408) - gitlab-runner
Summary I am receiving errors connecting to the Docker socket during the start ... Running with gitlab-ci-multi-runner dev (HEAD) on canny ...
Read more >How to Fix Docker Permission Denied Error on Ubuntu
Fix 1: Run all the docker commands with sudo If you have sudo access on your system, you may run each docker command...
Read more >azure - Unable to connect to the server: dial tcp: lookup ...
The error is coming because of private cluster. The Private Cluster option is enabled while creating the AKS cluster. You need to disable...
Read more >Head node fails to ssh into worker nodes
High: It blocks me to complete my task. Hello, I'm having a bit of trouble getting the cluster to work on Azure. The...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Might be related to https://github.com/ray-project/ray/issues/13711
Update: adding the recommended
sudo usermod -aG docker $USER; sleep 10
toinitialization_commands
worked!Ok! Then it’s just a matter of configuring those correctly.
I got it running here now! Can definitely do a
.yaml
update once I am able to run an experiment then if that’s ok? Currently stuck due to #14666