question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Can't start Kubernetes Cluster

See original GitHub issue

Hi there!

Having issues starting a Kubernetes cluster using the dask-labextension and unsure if it’s a bug or some mis-configuration. The symptoms are very similiar to the Slurm issue here https://github.com/dask/dask-labextension/issues/78 . When creating a cluster using the UI it errors with

Cluster Start Error 'LocalCluster' object has no attribute 'scheduler'

I am able to create Kubernetes cluster using dask_kubernetes directly fine and scale it up and down etc, however looking to be able to do this through the UI if possible for ease of use. I have followed the documentation in terms of putting the the dask.yaml file in the appropriate directory and I know it’s being read as I have mis-configured it purposely and it error-ed out.

Any help would be really appreciated, I’ve tried to capture all of the relevant information below;

Conda Packages

dask                      2.5.0                    pypi_0    pypi
dask-core                 2.4.0                      py_0    conda-forge
dask-gateway              0.3.0                    py37_0    conda-forge
dask-kubernetes           0.9.2                      py_0    conda-forge
dask-labextension         1.0.3                    pypi_0    pypi
distributed               2.5.1                    pypi_0    pypi

Jupyter Lab Extensions

JupyterLab v1.0.4
Known labextensions:
   app dir: /opt/conda/share/jupyter/lab
        @jupyter-widgets/jupyterlab-manager v1.0.2  enabled  OK
        @jupyterlab/git v0.8.2  enabled  OK
        @jupyterlab/plotly-extension v1.0.0  enabled  OK
        dask-labextension v1.0.0  enabled  OK
        jupyter-leaflet v0.11.2  enabled  OK
        nbdime-jupyterlab v1.0.0  enabled  OK

dask.yaml

distributed:
  logging:
    bokeh: critical
  scheduler:
    work-stealing: True
    allowed-failures: 5
  admin:
    tick:
      limit: 5s

kubernetes:
  name: dask-test
  worker-template:
    spec:
      restartPolicy: Never
      containers:
      - args:
          - dask-worker
          - --nthreads
          - '1'
          - --no-bokeh
          - --memory-limit
          - "1GB"
          - --death-timeout
          - '60'
        image: daskdev/dask:latest
        name: dask-test
        resources:
          limits:
            cpu: 0.2
            memory: "1G"

labextension:
  factory:
    module: dask_kubernetes
    class: KubeCluster
    args: []
    kwargs: {}

Thanks very much.

Issue Analytics

  • State:closed
  • Created 4 years ago
  • Comments:9 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
iwalmsleycommented, Oct 28, 2019

Just an update that using a real Kubernetes cluster (1.14.4) this does seem to be working fine.

Thanks for the great work.

0reactions
iwalmsleycommented, Oct 17, 2019

Same error as before with 0.10.0

distributed.nanny - ERROR - Timed out connecting Nanny '<Nanny: None, threads: 1>' to scheduler 'tcp://172.17.0.11:45533'
Traceback (most recent call last):
  File "/opt/conda/lib/python3.7/site-packages/distributed/nanny.py", line 309, in instantiate
    timedelta(seconds=self.death_timeout), self.process.start()
tornado.util.TimeoutError: Timeout
distributed.dask_worker - INFO - End worker

Edit: To confirm - it doesn’t work either using the Labextension or just the plain code above with 0.10.0, so I guess it’ll be worth me trying in a real Kubernetes cluster.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Troubleshooting kubeadm | Kubernetes
18 cannot join a cluster created by kubeadm v1.17. To workaround the issue you have two options: Execute kubeadm init phase bootstrap-token on...
Read more >
Unable to start kubernetes on docker desktop win 10 pro #3799
In the setting touch reset kubernetes cluster and restart docker. Enable kubernetes and wait, don 't select the other options.
Read more >
Kubernetes failed to start in docker desktop - How to resolve
How to resolve kubernetes failed to start in docker desktop? · 1. Firstly, click Docker.dmg · 2. Then, drag and drop Docker for...
Read more >
Unable to install Kubernetes, stuck on Starting state
Hello there, I am unable to install Kubernetes. Stuck on “Starting…” I've tried the following: Reset docker desktop and quit removed pki ......
Read more >
Docker for Windows stuck at "Kubernetes is Starting" after ...
Reset Docker to factory settings · Quit Docker · Set the KUBECONFIG environment variable to %USERPROFILE%\.kube\config · Restart Docker and enable ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found