[MITgcm-support] MPI problem on Archer (CRAY XC30)
David Ferreira
dfer at mit.edu
Mon Oct 20 10:14:51 EDT 2014
Hi all,
I'm having a problem running the coupled model on Archer.
The Archer support team discovered the following problem:
##########
...
---
Rank 0 [Tue Oct 7 22:44:10 2014] [c7-0c1s12n2] Fatal error in MPI_Recv:
Invalid
tag, error stack:
MPI_Recv(192): MPI_Recv(buf=0x7fffffff8000, count=1024, MPI_INTEGER,
src=1, tag=9862928,
comm=0x84000004, status=0x7fffffff7df0) failed
MPI_Recv(113): Invalid tag, value is 9862928
---
The maximum allowed value for the "tag" in MPI messages on Cray XC30 systems
is 4194303, which explains the error message
MPI_Recv(113): Invalid tag, value is 9862928
The MPI standard only requires that the maximum tag value is not less than
32767 (so Cray MPI is compliant). The maximum can be larger, and can be
found
using the MPI_Get_attr enquiry function. Your program should retrieve
this value
and ensure that any tags it specifies are no larger.
##########
I have absolutely no clue how to tell the model to choose MPI tags which
are below 4194303.
A bit of googling on MPI_Get_attr led me to some very obscure pages.
Any help is welcome.
Cheers,
david
More information about the MITgcm-support
mailing list