question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Allow defining custom MPI launchers for systems

See original GitHub issue

There are some systems I have access to where it’d be beneficial to use custom MPI launchers, different from the standard ones. Latest case I ran into is https://www.dur.ac.uk/icc/cosma/support/rockport/, where mpirun should be called as mpirun $RP_OPENMPI_ARGS, where $RP_OPENMPI_ARGS is an environment variable set by a module, which has to be loaded when running the tests (for example by setting systems.partitions.modules).

I may have time to work on this, but I need some guidance, especially in terms of what the API should be. In general, I want this to be system-specific (so to be customised in the system configuration), I don’t want to entangle tests with system-specific details.

Issue Analytics

  • State:closed
  • Created a year ago
  • Comments:7 (7 by maintainers)

github_iconTop GitHub Comments

1reaction
giordanocommented, Jul 12, 2022

Yes, I also wasn’t expecting it to work but it seems it does, and that’s perfect for my use case. The important seems to be to register the launcher before using it. An unregistered launcher still throws a useful error message:

reframe: failed to initialize runtime: no such launcher: 'unregistered_launcher'
0reactions
vkarakcommented, Aug 5, 2022

I have also been able to define a new launcher in the configuration. So we will relax the configuration schema and add a small tutorial on how to define a new custom launcher.

Read more comments on GitHub >

github_iconTop Results From Across the Web

MPIJob error · Issue #48 · kubeflow/mpi-operator - GitHub
When i use mpi-operator , meet a error. ... 2018-09-04T11:27:10.524007166Z one of the contrib/platform definitions for your system type.
Read more >
Configuring ReFrame for Your Site - Read the Docs
ReFrame provides an easy and flexible way to configure new systems and new programming environments. By default, it ships with a generic local...
Read more >
Best Practices for MPI Usage - IXPUG
TACC's universal MPI job starter on all the systems. • Supports multiple MPI stacks o Intel MPI: mpiexec.hydra o MVAPICH2: mpirun_rsh.
Read more >
Introduction to Kubeflow MPI Operator and Industry Adoption
MPI Operator provides a common Custom Resource Definition (CRD) for defining a training job on a single CPU/GPU, multiple CPU/GPUs, ...
Read more >
How to use Kubeflow and the MPI Operator on OpenShift
This blog post details an example of using Kubeflow and its MPI operator on top of OpenShift. You can find the supporting materials...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found