Allow defining custom MPI launchers for systems
See original GitHub issueThere are some systems I have access to where it’d be beneficial to use custom MPI launchers, different from the standard ones. Latest case I ran into is https://www.dur.ac.uk/icc/cosma/support/rockport/, where mpirun
should be called as mpirun $RP_OPENMPI_ARGS
, where $RP_OPENMPI_ARGS
is an environment variable set by a module, which has to be loaded when running the tests (for example by setting systems.partitions.modules
).
I may have time to work on this, but I need some guidance, especially in terms of what the API should be. In general, I want this to be system-specific (so to be customised in the system configuration), I don’t want to entangle tests with system-specific details.
Issue Analytics
- State:
- Created a year ago
- Comments:7 (7 by maintainers)
Top Results From Across the Web
MPIJob error · Issue #48 · kubeflow/mpi-operator - GitHub
When i use mpi-operator , meet a error. ... 2018-09-04T11:27:10.524007166Z one of the contrib/platform definitions for your system type.
Read more >Configuring ReFrame for Your Site - Read the Docs
ReFrame provides an easy and flexible way to configure new systems and new programming environments. By default, it ships with a generic local...
Read more >Best Practices for MPI Usage - IXPUG
TACC's universal MPI job starter on all the systems. • Supports multiple MPI stacks o Intel MPI: mpiexec.hydra o MVAPICH2: mpirun_rsh.
Read more >Introduction to Kubeflow MPI Operator and Industry Adoption
MPI Operator provides a common Custom Resource Definition (CRD) for defining a training job on a single CPU/GPU, multiple CPU/GPUs, ...
Read more >How to use Kubeflow and the MPI Operator on OpenShift
This blog post details an example of using Kubeflow and its MPI operator on top of OpenShift. You can find the supporting materials...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Yes, I also wasn’t expecting it to work but it seems it does, and that’s perfect for my use case. The important seems to be to register the launcher before using it. An unregistered launcher still throws a useful error message:
I have also been able to define a new launcher in the configuration. So we will relax the configuration schema and add a small tutorial on how to define a new custom launcher.