question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

How to get the same MPI/OpenMP mapping using dask-jobqueue

See original GitHub issue

Hi everybody,

I am using the following script to run a hybrid (MPI/OpenMP) application in a Cluster (each node in the cluster has 20 physical cores) via SLURM

#!/bin/bash
JOB=458754
for nrun in {1..10}
do
for npop in 600
do
  for niter in 100
  do
     for boundary in reflecting
     do
SAIDA=$(sbatch -o "${nrun}runs${npop}x${niter}_${boundary}_gbest_canonical_cc_em_375x369_5000_2_2.txt" --ntasks=300 --cpus-per-task=2 --ntasks-per-node=10 --ntasks-per-socket=5 -p cluster128g --requeue ./script.sh $nrun $npop $niter $boundary 2)
    JOB=$SAIDA
    done
  done
done
done
echo "last job was $JOB

Thus, I map 10 MPI process on each node of the cluster, each of which spawns two OpenMP threads (10 MPI x 2 OpenMP threads= 20cores). I rewrote my application in python and I would like to use dask to parallelize it. However, I couldn’t map more than one worker in a node in my first attempts. How can I do that? I mean map more than one dask worker in the nodes such way I take advantage of all resources. I looked for some example trought the issues list but I could not find any. Some idea?

Thanks

Issue Analytics

  • State:closed
  • Created 5 years ago
  • Comments:14 (7 by maintainers)

github_iconTop GitHub Comments

1reaction
ofmlacommented, Apr 2, 2019

@guillaumeeb thanks for your readiness to support and assist and sorry for the late reply, I had a deadline lately which took my time. I am still confused about on dask-jobqueue operation and I am facing some issues. Anyway, I will perform some tests following your comment and let you know if it works.

0reactions
guillaumeebcommented, Aug 30, 2022

Closing this issue as stale. @ofmla if you ever work on this again, feel free to reopen.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Parallel Analysis with Dask - MPI-Met Wiki
Dask parallelise the processing of data by (1) dividing the arrays into chunks ... You can use Dask either in ipython or in...
Read more >
How to do composite parallelism (similar to MPI+OpenMP ...
I'm just learning Dask and understand it's application to embarassingly-parallel tasks. I have a function that reads data from a single file and ......
Read more >
Distributed computing using Python/Dask on the CC-IN2P3 ...
You will find below the instructions to get a simple python 3 virtual environment with all needed libraries to execute basic jobs using...
Read more >
Configure Dask-Jobqueue
To properly use Dask and Dask-Jobqueue on an HPC system you need to provide a bit of information about that system and how...
Read more >
Parallelisation — SWD6: High Performance Python
MapReduce (map a function to the data and reduce / summarise the output). ... You then connect to this Dask manager using a...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found