question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

GatewayCluster with kubernetes backend fails to start after update to daskhub v4.5.4

See original GitHub issue

What happened: I received the following error: GatewayClusterError: Cluster 'adrastea.b4286778ea9b49f4b4264f982f5b278d' failed to start, see logs for more information. The logs suggest that the dask-scheduler command is missing an argument after --host, which looks intentional based on this code. Here are the logs: image

This occurred upon update from v4.5.3 of the daskhub chart to v4.5.4. Note that several other issues occurred related to jupyterhub. I eventually worked my way through those and ultimately just deleted and recreated our GKE cluster. That fixed these other issues (primarily related to authentication) but this one remains.

What you expected to happen: A working GatewayCluster object to be returned from the gateway.new_cluster() call

Minimal Complete Verifiable Example: I’d imagine a lot of the reproducibility depends on our specific GKE infrastructure and chart config, but the actual code that raises this bug is just

import dask_gateway

gateway = dask_gateway.GatewayCluster()
cluster = gateway.new_cluster()

Anything else we need to know?:

Environment:

  • GKE cluster
  • daskhub chart version: 4.5.4
  • image: custom image running the following
    • Dask version: 2.30.0
    • Python version: 3.8.6
    • Operating System: ubuntu
    • Install method (conda, pip, source): conda

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:9 (2 by maintainers)

github_iconTop GitHub Comments

1reaction
bolliger32commented, Nov 11, 2020

In the end, I just decided to go with the default scheduler and use our customized image just on the worker. Seems to work if the important packages are pinned to the same version across these images. I’m sure it wouldn’t take too much digging to figure out what was going on. Let me know if you think that would be helpful and I’m happy to give it a bit more effort. Otherwise, I think we can close this

0reactions
consideRatiocommented, Aug 30, 2021

Thanks for following up @bolliger32!

Read more comments on GitHub >

github_iconTop Results From Across the Web

Install on a Kubernetes Cluster - Dask Gateway
The Dask Gateway pods running on Kubernetes include the following: - api : The ... To install the Dask-Gateway Helm chart, run the...
Read more >
daskhub 4.5.4 · dask/dask - Artifact Hub
This chart will deploy the following. A standard Dask Gateway deployment using the Dask Gateway helm chart, configured to use JupyterHub for authentication....
Read more >
Topics tagged dask-gateway
Failing to look at gateway cluster when using k8s portforwarding ... Heterogeneous clusters (Kubernetes + HPC workers) with Dask Gateway?
Read more >
Deploy your own Hub - Documentation | Planetary Computer
Deploy JupyterHub and Dask Gateway using the daskhub Helm chart. ... If you're new to Azure, Kubernetes, or JupyterHub, then you should try...
Read more >
Feature request: Save results as CSV - Qarmin/Czkawka - IssueHint
This issue has been created since 2022-01-19. ... GatewayCluster with kubernetes backend fails to start after update to daskhub v4.5.4, 9, 2020-11-10 ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found