[MITgcm-support] crash with a new processor / grid size setup

Dimitris Menemenlis dmenemenlis at gmail.com
Fri Mar 27 20:12:01 EDT 2015


pretty sure that nPy must be equal to 1 for cube sphere topologies
try nPx=68*18, nPy=1

> On Mar 27, 2015, at 4:58 PM, Neil Patel <nigellius at gmail.com> wrote:
> 
> I recompiled my MITgcm setup (cube face 510 x 6 x 510) for a higher number of processors (1224 cores). I used these parameters in my SIZE.h:
> 
> (PID.TID 0000.0001)      nPx =   68 ; /* No. processes in X */
> (PID.TID 0000.0001)      nPy =   18 ; /* No. processes in Y */
> (PID.TID 0000.0001)      nSx =    1 ; /* No. tiles in X per process */
> (PID.TID 0000.0001)      nSy =    1 ; /* No. tiles in Y per process */
> (PID.TID 0000.0001)      sNx =   75 ; /* Tile size in X */
> (PID.TID 0000.0001)      sNy =   17 ; /* Tile size in Y */
> (PID.TID 0000.0001)      OLx =    4 ; /* Tile overlap distance in X */
> (PID.TID 0000.0001)      OLy =   4 ; /* Tile overlap distance in Y */
> (PID.TID 0000.0001)      nTx =    1 ; /* No. threads in X per process */
> (PID.TID 0000.0001)      nTy =    1 ; /* No. threads in Y per process */
> (PID.TID 0000.0001)       Nr =   50 ; /* No. levels in the vertical   */
> 
> and the program crashed with this error message:
> 
> ABNORMAL END: S/R W2_SET_MAP_TILES
> 
> Haven’t seen this one before. Anyone have any clue what to check? Number of grid cells = cube face, told the computer I was running mitgcm with 1224 cores, can’t think of what else I would need to change. Checking STDOUT.0000, these are the last line outputted:
> 
> (PID.TID 0000.0001) // ======================================================
> (PID.TID 0000.0001) // Mapping of tiles to threads
> (PID.TID 0000.0001) // ======================================================
> (PID.TID 0000.0001) // -o- Thread   1, tiles (   1:   1,   1:   1)
> (PID.TID 0000.0001) 
> (PID.TID 0000.0001) W2_READPARMS: file data.exch2 not found
> (PID.TID 0000.0001) => use W2_EXCH2 default: regular 6-facets Cube
> (PID.TID 0000.0001) W2_useE2ioLayOut=    T ;/* T: use Exch2 glob IO map; F: use model default */
> (PID.TID 0000.0001) W2_mapIO        =  -1 ; /* select option for Exch2 global-IO map */
> (PID.TID 0000.0001) W2_printMsg     =  -1 ; /* select option for printing information */
> (PID.TID 0000.0001) ===== Start setting W2 TOPOLOGY:
> (PID.TID 0000.0001)  write to log-file: w2_tile_topology.0000.log
> 
> the tiles log files exist and this is each one contains:
> 
> (PID.TID 0003.0001) W2_E2SETUP: number of Active Tiles =    1224
> (PID.TID 0003.0001) W2_E2SETUP: number of Blank Tiles  =       0
> (PID.TID 0003.0001) W2_E2SETUP: Total number of Tiles  =    1224
> (PID.TID 0003.0001) W2_SET_CS6_FACETS: preDefTopol=  3 selected
> (PID.TID 0003.0001) W2_SET_CS6_FACETS: facet-dims Unset; assume nRd=nGr=nBl=  510
> (PID.TID 0003.0001) W2_E2SETUP: Total number of Facets =       6
> (PID.TID 0003.0001) W2_SET_F2F_INDEX: index matrix for connected Facet-Edges:
> (PID.TID 0003.0001) W2_SET_MAP_TILES: tile mapping within facet and global Map:
> 
> Thanks,
> 
> Neil
> _______________________________________________
> MITgcm-support mailing list
> MITgcm-support at mitgcm.org
> http://mitgcm.org/mailman/listinfo/mitgcm-support




More information about the MITgcm-support mailing list