question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

CuPy MPI demo is failing for Bcast

See original GitHub issue

Description

I’m trying to run a CuPy MPI demo. Allreduce example is working but Bcast and p2p examples are failing with the segmentation fault error:

Caught signal 11 (Segmentation fault: invalid permissions for mapped object at address 0x2b15b6a00310)

Env

  1. openmpi/4.0.1 build with CUDA support
  2. cuda/10.2.89
  3. mpi4py/3.1.0a0
  4. cupy-cuda102/8.5.0

Additional

I’ve tried to run the example with a single GPU and with multiple GPUs (manually specifying the Device dependent on a rank). In both cases I’m still getting the error.

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:10 (10 by maintainers)

github_iconTop GitHub Comments

1reaction
lebedovcommented, Apr 11, 2021

I’m not @lefarov

0reactions
leofangcommented, Apr 11, 2021

Oops sorry @lebedov, I meant to ping @lefarov but GitHub autocomplete messed up.

Read more comments on GitHub >

github_iconTop Results From Across the Web

python - mpi4py | comm.bcast does not work - Stack Overflow
1 Answer 1 · Your variable scal and your numpy arrays mat , arr and results are only defined on rank 0. They...
Read more >
v9.2.0 PDF - CuPy Documentation
If you are using certain versions of conda, it may fail to build CuPy with ... MPI for Python (mpi4py) is a Python...
Read more >
Tutorial — MPI for Python 3.1.4 documentation
The book covers parallel programming with MPI and OpenMP in C/C++ and Fortran, and MPI in Python using mpi4py. MPI for Python supports...
Read more >
mpi4py/use_cupy.py at master - GitHub
Demonstrate how to work with Python GPU arrays using CUDA-aware MPI. # We choose the CuPy library for simplicity, but any CUDA array...
Read more >
Message Passing with MPI - RWTH Aachen
failure otherwise (MPI error handler called before function return) ... PPCES, 20th March 2012. RZ: Hristo Iliev. Slide 38. MPI – Part 1:...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found