Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MPI (message passing interface) --- run several M2 processes in parallel #2129

Draft
wants to merge 37 commits into
base: development
Choose a base branch
from

Conversation

antonleykin
Copy link
Contributor

@antonleykin antonleykin commented Jun 3, 2021

We want to implement coarse parallelization via basic MPI routines --- collaborators are welcome.
See the commentary in packages/MPI.m2

To implement:

  • Basic message-passing functions in the kernel
  • Additional message-passing functions in the kernel
  • A package facilitating the use of the functions above

@mahrud
Copy link
Member

mahrud commented Apr 30, 2023

  • Here's the result of running the three MPI examples on Github Actions, built using CMake on macOS, and linked with MPICH installed via Homebrew.

  • example for normalToricRing(Ideal,Thing) fails, I'm not sure why.

  • M2-binary compiles on Ubuntu, but crashed when building tvalues.m2

    • might have something to do with Ubuntu's MPICH vs Homebrew's

@antonleykin
Copy link
Contributor Author

@DanGrayson and @mahrud, I pushed a commit that has a change that makes it possible to run M2 both with mpirun and without. It builds for me with autotools and cmake on my Ubuntu --- see
M2/BUILD/MPI/Makefile
but I'm not sure how alter the "actions" to check everything

@mahrud
Copy link
Member

mahrud commented May 1, 2023

I think you should wrap things like #include <mpi.h> with a macro that checks if MPI is being used.

(sorry about the close/reopen, that was a mis-click)

@mahrud mahrud closed this May 1, 2023
@mahrud mahrud reopened this May 1, 2023
@antonleykin
Copy link
Contributor Author

I think you should wrap things like #include <mpi.h> with a macro that checks if MPI is being used.

We probably should just require MPI at build time.
The basic functionality works and (unless it breaks anything anywhere in the non-MPI part) we should distribute the MPI-capable binaries in the next release.

The only current "fail" I see looks unrelated to MPI:
https://github.com/Macaulay2/M2/actions/runs/4853189961/jobs/8649075617?pr=2129#step:9:3871

@mahrud
Copy link
Member

mahrud commented May 1, 2023

Even if you want the standard distribution to be MPI-capable (which is going to be difficult with brew and potentially also debian, since typically MPI-capable binaries are distributed in separate packages), there should be a way to disable MPI, similar to TBB, OpenMP, Python, etc.

@antonleykin
Copy link
Contributor Author

Even if you want the standard distribution to be MPI-capable (which is going to be difficult with brew and potentially also debian, since typically MPI-capable binaries are distributed in separate packages), there should be a way to disable MPI, similar to TBB, OpenMP, Python, etc.

Fair enough. What would be the best way to omit MPI code with macros?

@mahrud
Copy link
Member

mahrud commented May 1, 2023

Inside main() there are several blocks wrapped in #if WITH_PYTHON ... #endif or #if PROFILING ... #endif, etc. I think a similar block like WITH_MPI in a few places would work.

Also, once MPI is a package and has its own tests, could you revert f1aa4e4?

@antonleykin
Copy link
Contributor Author

@mahrud, I've encapsulated all MPI-related code with macros, so ubuntu non-MPI builds go through... but I don't understand why the MPI build fails now.

@DanGrayson, could you glance at M2/configure.ac? The autotools non-MPI ubuntu build works... but I'm not sure how to complete this so that MPI works as well.

@mahrud
Copy link
Member

mahrud commented May 4, 2023

Tabs aren't allowed in yaml.

@antonleykin
Copy link
Contributor Author

@mahrud, I've encapsulated all MPI-related code with macros, so ubuntu non-MPI builds go through... but I don't understand why the MPI build fails now.

I'll need some help with cmake here... since cmake fails both on ubuntu and macOS in a similar fashion: the object file coming from d/M2lib.c is missing symbols "_MPI..."

Could it be that WITH_MPI is somehow undefined when d/M2lib.c is compiled?

@DanGrayson, could you glance at M2/configure.ac? The autotools non-MPI ubuntu build works... but I'm not sure how to complete this so that MPI works as well.

All seems fine now with the MPI build via autotools --- the problem above doesn't emerge.

@mahrud
Copy link
Member

mahrud commented May 5, 2023

Could it be that WITH_MPI is somehow undefined when d/M2lib.c is compiled?

Yes, sorry I should have added this when I added the WITH_MPI option.

I also cherry-picked a fix for Normaliz 3.10, which is on development (it might be helpful to rebase on top of development and force push).

@antonleykin
Copy link
Contributor Author

Everything is fine with openmpi, but there is an obstacle with mpich:
there is no analog ot variable OMPI_COMM_WORLD_RANK (used, e.g., in M2/Macaulay2/bin/main.cpp).
Is there any way to know if mpirun was used to execute the program?

I've replaced openmpi with mpich on my ubuntu and now M2 crashes (unless executed with mpirun).

@antonleykin antonleykin changed the base branch from master to development May 5, 2023 15:35
@mahrud
Copy link
Member

mahrud commented May 5, 2023

Looking online, one suggestion seems to be to use the function MPI_Comm_rank instead, is that the same?

@antonleykin
Copy link
Contributor Author

Looking online, one suggestion seems to be to use the function MPI_Comm_rank instead, is that the same?

Well, probing the environment variable was a way to determine if the execution started via mpirun. It looks like MPI builds successfully now on github... so perhaps I overlooked something on my local machine.

@antonleykin
Copy link
Contributor Author

Even if you want the standard distribution to be MPI-capable (which is going to be difficult with brew and potentially also debian, since typically MPI-capable binaries are distributed in separate packages), there should be a way to disable MPI, similar to TBB, OpenMP, Python, etc.

Let me ask the opinion of @d-torrance (at least on debian). Do you think there need to distribute the MPI version of M2 separately?
(So far, on ubuntu and macOS, there seems to be no need to have two binaries: the one we build can be executed with or without mpirun.)

@mahrud
Copy link
Member

mahrud commented May 6, 2023

Now that the tests pass, could you turn the MPI tests into either tests for the package or perhaps tests under Macaulay2/tests/MPI? (I think the former is better, unless you don't plan there to be an MPI package)

After this, we can remove this section:

- name: Run MPI test examples
if: matrix.compiler == 'MPI'
run: |
mpirun -np 4 ./M2 --script ../../Macaulay2/packages/MPI/example-homotopies.m2
mpirun -np 4 ./M2 --script ../../Macaulay2/packages/MPI/example2.m2
mpirun -np 4 ./M2 --script ../../Macaulay2/packages/MPI/master-worker-MPI.m2

@antonleykin
Copy link
Contributor Author

I had success building this on Fedora 34.
It still segfaults on Ubuntu.

@antonleykin
Copy link
Contributor Author

The recent attempt to have "master" interrupt "workers" doesn't succeed yet:
M2/Macaulay2/packages/MPI/test-interruption.m2

The interruption should occur when MPI_Probe detects the necessary message from master:
M2/Macaulay2/e/interrupted.cpp

BTW, there may be a bug in the build process: WITH_MPI doesn't seem to be defined when interrupted.cpp is compiled (that's why #ifdef is commented out).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants