⇤ ← Revision 1 as of 2010-04-27 21:17:06
2151
Comment:
|
2150
|
Deletions are marked like this. | Additions are marked like this. |
Line 41: | Line 41: |
Category MagparBugConfirmed | CategoryMagparBugConfirmed |
Description
This is not strictly a magpar bug but a problem with MPICH2 (version 1.2 or later), which affects parallel runs on multiple machines.
Steps to reproduce
Prepare parallel magpar run on Linux cluster using MPICH2 >= 1.2
- Start mpd ring spanning several machines using "mpdboot" command
- mpdboot command does not return (hangs)
Example and Details
Workaround
Follow the instructions at the bottom of this page in the MPICH2 trac system.
or edit/patch mpd.py directly according to this changeset
or download this version of mpd.py and use it instead of the mpd.py installed by MPICH2 >=1.2.
or use older MPICH2 version 1.0.8p
Plan
- Priority: Medium
- Assigned to: MPICH2 developers
Status: fixed in MPICH2 revision 5923
Wait for new MPICH2 release, then update magpar's Makefile.libs to default to new (fixed) MPICH2 version.