Installing dask and distributed packages can be confusing
See original GitHub issueOur installation docs recommend that people do the following
conda install dask distributed -c conda-forge
or
pip install dask[complete] distributed --upgrade
However we shouldn’t expect most people to do the proper diligence of reading installation docs. We all tend to just guess that conda install name-of-project
works pretty well most of the time. Unfortunately, if you’ve heard that Dask does distributed computing, you conda install dask
, and then try out any distributed example then you’re likely to receive an import error, which makes for a bad first impression.
There are a few ways that we could resolve this problem:
- We could provide informative errors whenever someone tries to do a
dask.distributed
thing. These would point them to installation docs. This wouldn’t help if the just didimport distributed
though I think that most of the public materials we produce at this point always import fromdask.distributed
. - We could switch out the conda package
dask
with a metapackage that included both dask and distributed. This would be foolproof in the conda case but would be a bit of an organizational hassle from a packaging perspective. We would rename the existing package dask-core (or something similar) and then switch in the dask metapackage. We would have to do this on conda-forge at the same time. - We could find some way within conda to have a cycle (dask includes distributed, distributed includes dask)
- Other suggestions?
I’m in favor of starting with option 1, though would love to find a more thorough alternative.
Issue Analytics
- State:
- Created 6 years ago
- Reactions:2
- Comments:27 (14 by maintainers)
Top Results From Across the Web
Installing dask and distributed packages can be confusing
Installing dask and distributed packages can be confusing. ... Our installation docs recommend that people do the following conda install dask distributed ...
Read more >Install Dask.Distributed
To install distributed from source, clone the repository from github: git clone https://github.com/dask/distributed.git cd distributed python -m pip install ...
Read more >Default pip installation of Dask gives "ImportError: No module ...
pip install dask : Install only dask, which depends only on the standard library. This is appropriate if you only want the task...
Read more >Dask - To Distribute or Not To Distribute..Ahh..This Thing Sucks.
Ok, so we are going to install Dask on my 4 node (Linode) cluster. ... environment setup and all your needed packages installed...
Read more >Set up a Dask Cluster for Distributed Machine Learning
Setting up a Dask cluster using SSH connections might seem like the easiest of the bunch, but it is also the most unstable,...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
We’ve all made that mistake at least once 😃
On Sun, Apr 29, 2018 at 6:09 AM, YorT notifications@github.com wrote:
Why is the Conda package for
dask.distributed
not calleddask.distributed
ordask-distributed
(what I would expect to install, but does not exist)? It’s calleddistributed
which is super-confusing and there is nothing helpful in the output ofconda info distributed -c conda-forge
. Further,distributed
does not express a dependency ondask
. Am I even looking at the right package?