Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mpirun needs "--bind-to none" specified if used for multiple executions in parallel #147

Open
3 tasks
cchall opened this issue Apr 25, 2023 · 2 comments
Open
3 tasks

Comments

@cchall
Copy link
Member

cchall commented Apr 25, 2023

There is significant slowdown observed that scales up as the number of simultaneous executions of mpirun.
Adding --bind-to none see seems to mostly resolve the issue.

  • Check if this issue exists as you add more cores to each execution
  • Look at MPI specific executor settings for rsopt. Would probably want to only use this for openmpi. I'm not aware of any similar problem ever occurring on MPICH systems.
  • Could be motivation to revisit a true Serial executor. I think there was talk of putting this into libEnsemble. Though it is difficult since we may want to accommodate a serial and MPI executor simultaneously.
@robnagler
Copy link
Member

Sirepo always passes --bind-to-none. I don't think we've had issues with mpich.

@cchall
Copy link
Member Author

cchall commented Apr 25, 2023

Thanks @robnagler thats good to know

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants