Description
This is not strictly a magpar bug but a problem with MPICH2 (version 1.2 and 1.2.1), which affects parallel runs on multiple machines.
Steps to reproduce
- Prepare parallel magpar run on Linux cluster using MPICH2 1.2 or 1.2.1
- Start mpd ring spanning several machines using "mpdboot" command
- mpdboot command does not return (hangs)
Example and Details
Workaround
Use MPICH2 version 1.2.1p1 or later
or follow the instructions at the bottom of this page in the MPICH2 trac system.
or edit/patch mpd.py directly according to this changeset
or download this version of mpd.py and use it instead of the mpd.py installed by MPICH2 >=1.2.
or use older MPICH2 version 1.0.8p1
Plan
- Priority: Medium
- Assigned to: MPICH2 developers