[MITgcm-devel] cs510 on IBM p690

Dimitris Menemenlis menemenlis at sbcglobal.net
Wed Feb 14 11:42:59 EST 2007


> can you do a 27 CPU test on columbia

27-cpu will not work on columbia because of limited memory

54-cpu works fine

Could Martin's problem be an MPI buffer overflow, which is not caught by MPI 
implementation?  Or maybe a bug in the particular MPI library that he is using? 
  In running on Columbia we had to play around with a bunch of MPI variables 
For a 216-CPU configuration, we are using following environment variables:

limit descriptors unlimited
limit stacksize 2000m
limit coredumpsize 1
module load modules scsl.1.6.1.0 intel-comp.9.1.039 mpt.1.12.0.nas
setenv MPI_DSM_DISTRIBUTE
setenv MPI_BUFS_PER_PROC  512
setenv MPI_BUFS_PER_HOST  512
setenv MPI_MSGS_PER_HOST 2048
setenv MPI_MSGS_PER_PROC 1024
setenv MPI_MSG_RETRIES 5000
setenv KMP_STACKSIZE 1000m
setenv KMP_LIBRARY turnaround

Not sure in detail what all of the above do but they are needed both to speed up 
the code "and" to avoid MPI buffer overflows.

D.



More information about the MITgcm-devel mailing list