How to get the same MPI/OpenMP mapping using dask-jobqueue
See original GitHub issueHi everybody,
I am using the following script to run a hybrid (MPI/OpenMP) application in a Cluster (each node in the cluster has 20 physical cores) via SLURM
#!/bin/bash
JOB=458754
for nrun in {1..10}
do
for npop in 600
do
for niter in 100
do
for boundary in reflecting
do
SAIDA=$(sbatch -o "${nrun}runs${npop}x${niter}_${boundary}_gbest_canonical_cc_em_375x369_5000_2_2.txt" --ntasks=300 --cpus-per-task=2 --ntasks-per-node=10 --ntasks-per-socket=5 -p cluster128g --requeue ./script.sh $nrun $npop $niter $boundary 2)
JOB=$SAIDA
done
done
done
done
echo "last job was $JOB
Thus, I map 10 MPI process on each node of the cluster, each of which spawns two OpenMP threads (10 MPI x 2 OpenMP threads= 20cores). I rewrote my application in python and I would like to use dask to parallelize it. However, I couldn’t map more than one worker in a node in my first attempts. How can I do that? I mean map more than one dask worker in the nodes such way I take advantage of all resources. I looked for some example trought the issues list but I could not find any. Some idea?
Thanks
Issue Analytics
- State:
- Created 5 years ago
- Comments:14 (7 by maintainers)
Top Results From Across the Web
Parallel Analysis with Dask - MPI-Met Wiki
Dask parallelise the processing of data by (1) dividing the arrays into chunks ... You can use Dask either in ipython or in...
Read more >How to do composite parallelism (similar to MPI+OpenMP ...
I'm just learning Dask and understand it's application to embarassingly-parallel tasks. I have a function that reads data from a single file and ......
Read more >Distributed computing using Python/Dask on the CC-IN2P3 ...
You will find below the instructions to get a simple python 3 virtual environment with all needed libraries to execute basic jobs using...
Read more >Configure Dask-Jobqueue
To properly use Dask and Dask-Jobqueue on an HPC system you need to provide a bit of information about that system and how...
Read more >Parallelisation — SWD6: High Performance Python
MapReduce (map a function to the data and reduce / summarise the output). ... You then connect to this Dask manager using a...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
@guillaumeeb thanks for your readiness to support and assist and sorry for the late reply, I had a deadline lately which took my time. I am still confused about on dask-jobqueue operation and I am facing some issues. Anyway, I will perform some tests following your comment and let you know if it works.
Closing this issue as stale. @ofmla if you ever work on this again, feel free to reopen.