[MITgcm-support] old pickups, new model version -- fails on first timestep with STOP in CALC_R_STAR

Rose, Brian brose at albany.edu
Wed Nov 5 10:48:45 EST 2014


Sorry, I just realized that the checkpoint is clearly labelled in the STDOUT files that I have from the old runs:

(PID.TID 0000.0001) // MITgcmUV version:  checkpoint61t
(PID.TID 0000.0001) // Build user:        brose
(PID.TID 0000.0001) // Build host:        compute-1-24.local
(PID.TID 0000.0001) // Build date:        Thu Sep 30 16:17:04 EDT 2010

Bear with me! I'm a little rusty on my MITgcm skills!
Thanks
Brian

On Nov 5, 2014, at 10:42 AM, Rose, Brian <brose at albany.edu<mailto:brose at albany.edu>> wrote:

Hi Jean-Michel!

To answer your questions:
1) Yes, it's the ocean. The error appears in STDERR files for several (not all) of the ocean ranks
2) Good question. The checkpoint information might be lost forever, as everything was on beagle.darwinproject.mit.edu<http://beagle.darwinproject.mit.edu/>, which has been taken offline. Looking back at my notes, I first started setting up and running the coupled model on beagle around August 2009.
3) The content of one pickup*.meta file and the data file (both for the ocean component) appended below.

Thanks for any suggestions!
- Brian

[br546577 at headnode Ocn]$ cat pickup.0424224000.001.001.meta
 nDims = [   2 ];
 dimList = [
   144,    1,   12,
    24,    1,   24
 ];
 dataprec = [ 'float64' ];
 nrecords = [   183 ];
 timeStepNumber = [  424224000 ];
 /* modelTime = [  1.527206400000E+12 ];*/
 nFlds = [   15 ];
 fldList = {
 'Uvel    ' 'Vvel    ' 'Theta   ' 'Salt    ' 'GuNm1   ' 'GuNm2   ' 'GvNm1   ' 'GvNm2   ' 'TempNm1 ' 'TempNm2 ' 'SaltNm1 ' 'SaltNm2 ' 'EtaN    ' 'dEtaHdt ' 'EtaH    '
 };


[br546577 at headnode rank_1]$ cat data
# ====================
# | Model parameters |
# ====================
#
# Continuous equation parameters
 &PARM01
 tRef=16.5, 14.0, 13.5, 13.0, 12.0,
      10.0,  6.7,  4.0,  2.2,  1.0,
       0.2, -0.3, -0.7, -1.1, -1.4,
 sRef=15*35.,
 viscAh =3.E5,
 viscAr =1.E-3,
 bottomDragLinear=1.E-3,
 diffKhT=0.,
 diffK4T=0.,
 diffKrT=3.E-5,
 diffKhS=0.,
 diffK4S=0.,
 diffKrS=3.E-5,
 gravity=9.81,
 rhonil=1030.,
 rhoConstFresh=1000.,
 eosType='JMD95Z',
#allowFreezing=.TRUE.,
 ivdc_kappa=10.,
 implicitDiffusion=.TRUE.,
 implicitFreeSurface=.TRUE.,
 exactConserv=.TRUE.,
 select_rStar=2,
 nonlinFreeSurf=4,
 hFacInf=0.2,
 hFacSup=2.0,
 useRealFreshWaterFlux=.TRUE.,
 convertFW2Salt=34.,
 temp_EvPrRn=0.,
 hFacMin=.1,
 hFacMinDr=20.,
 vectorInvariantMomentum=.TRUE.,
 staggerTimeStep=.TRUE.,
 readBinaryPrec=64,
 writeBinaryPrec=64,
#debugLevel=0,
#tempAdvScheme=20,
#saltAdvScheme=20,
 &

# Elliptic solver parameters
 &PARM02
 cg2dMaxIters=200,
 cg2dTargetResidual=1.E-9,
#cg2dTargetResWunit=1.E-14,
 &

# Time stepping parameters
 &PARM03
 nIter0=424224000,
 nTimeSteps=172800,
 pChkptFreq=622080000.,
 taveFreq=0.,
 dumpFreq=622080000.,
 monitorFreq=2592000.,
 deltaTmom=3600.,
 deltaTtracer=3600.,
 deltaTClock =3600.,
#abEps=0.12,
#alph_AB=0.62,
#beta_AB=0.,
 alph_AB=0.5,
 beta_AB=0.281105,
 doAB_onGtGs=.FALSE.,
 forcing_In_AB=.FALSE.,
 momDissip_In_AB=.FALSE.,
 periodicExternalForcing=.TRUE.,
 externForcingPeriod=2592000.,
 externForcingCycle=31104000.,
 pickupStrictlyMatch=.TRUE.,
 &

# Gridding parameters
 &PARM04
 usingCurvilinearGrid=.TRUE.,
#horizGridFile='dxC1_dXYa',
#delR= 50., 70., 100., 140., 190.,
#      240., 290., 340., 390., 440.,
#      490., 540., 590., 640., 690.,
 delR= 30.,  40.,  60.,  80., 110.,
      140., 160., 200., 220., 260.,
      280., 310., 340., 370., 400.,
 &

# Input datasets
 &PARM05
 bathyFile      ='Ridge.c24.Bathy.3p0km.bin',
 hydrogThetaFile='Rg3p0_Cpl289T_C24_z15.bin',
 hydrogSaltFile ='Rg3p0_Cpl289S_C24_z15.bin',
 &


On Nov 5, 2014, at 8:50 AM, Jean-Michel Campin <jmc at ocean.mit.edu<mailto:jmc at ocean.mit.edu>> wrote:

Hi Brian,

The way MITgcm is able to restart, even after some code changes affecting
the content of pickup files, is by checking the content of all pickup*.meta

Now in your case:
1) can you confirm that it's the ocean component that is stopping ?
2) how old was your original code (date ? checkpoint ?)
3) what is the content of 1 of these oceanic pickup*.meta file ?
  and your main parameter file "data" ?
Since it's stopping even before starting the 1rst iteration (i.e.,
during initialisation), might not be too difficult to find what is
wrong.

Cheers,
Jean-Michel

On Wed, Nov 05, 2014 at 02:22:14AM +0000, Rose, Brian wrote:
Hi MITgcmers,
I'm trying to revive some calculations from several years ago. The model setup is coupled ocean + AIM atmosphere + thsice on C24 grid, as used in a number of papers by Ferreira, Rose, Marshall, etc.
I have all pickup and configuration files from these old runs, but unfortunately no longer have access to the machine I used to run them on.

So, I have ported an up-to-date version of MITgcm to our new cluster here at U. Albany. I successfully built and ran the test case
MITgcm/verification/cpl_aim+ocn
But when I try to set up the C24 coupled model as close to my previous runs as possible and initialize with my previous pickup files, the ocean model crashes on the first time step with errors like this:

fail at i,j=   1  25 ; rStarFacC,H,eta = -3.398999  3.000000E+03 -1.319700E+04
fail at i,j=   1  25 ; rStarFacS,H,eta = -1.730615  3.000000E+03 -6.998053E+00 -1.319700E+04
fail at i,j=   3  25 ; rStarFacC,H,eta =**********  3.000000E+03 -8.601127+273
fail at i,j=   3  25 ; rStarFacS,H,eta =**********  3.000000E+03 -7.290877E+00 -8.601127+273
fail at i,j=   5  25 ; rStarFacC,H,eta =**********  3.000000E+03 -4.028258E+08
fail at i,j=   5  25 ; rStarFacS,H,eta =**********  3.000000E+03 -8.277906E+00 -4.028258E+08
fail at i,j=   6  25 ; rStarFacC,H,eta =**********  3.000000E+03 -3.952771+291
fail at i,j=   6  25 ; rStarFacS,H,eta =**********  3.000000E+03 -9.155281E+00 -3.952771+291
fail at i,j=   8  25 ; rStarFacC,H,eta =**********  3.000000E+03 -5.011502E+48
fail at i,j=   8  25 ; rStarFacS,H,eta =**********  3.000000E+03 -1.126187E+01 -5.011502E+48
fail at i,j=  11  25 ; rStarFacC,H,eta =**********  3.000000E+03 -8.416418+231
fail at i,j=  11  25 ; rStarFacS,H,eta =**********  3.000000E+03 -1.384315E+01 -8.416418+231
fail at i,j=  13  25 ; rStarFacC,H,eta =**********  3.000000E+03 -2.545287E+63
WARNING: r*FacC < hFacInf at       7 pts : bi,bj,Thid,Iter=   1   1   1        -1
WARNING: r*FacS < hFacInf at       6 pts : bi,bj,Thid,Iter=   1   1   1        -1
STOP in CALC_R_STAR : too SMALL rStarFac[C,W,S] !

I'm not sure where to begin in debugging this.
I found a few references to similar-sounding problems in the support archives, but no clear answers. Someone suggested that this problem could arise from heavy sea ice loading. That is certainly the case here. The pickup files are from a very cold simulation with some tens of meters of sea ice. However this never caused problems before.

Any hints or suggestions?
Thanks,
Brian

|--------------------------------------------------------------------------------------------------------------------------------------|
 Brian E. J. Rose
    Assistant Professor
    Atmospheric & Environmental Sciences
    University at Albany

    Earth Sciences 315
    (518) 442-4477
    http://www.atmos.albany.edu/facstaff/brose/
|--------------------------------------------------------------------------------------------------------------------------------------|


_______________________________________________
MITgcm-support mailing list
MITgcm-support at mitgcm.org<mailto:MITgcm-support at mitgcm.org>
http://mitgcm.org/mailman/listinfo/mitgcm-support


_______________________________________________
MITgcm-support mailing list
MITgcm-support at mitgcm.org<mailto:MITgcm-support at mitgcm.org>
http://mitgcm.org/mailman/listinfo/mitgcm-support

|--------------------------------------------------------------------------------------------------------------------------------------|
  Brian E. J. Rose
     Assistant Professor
     Atmospheric & Environmental Sciences
     University at Albany

     Earth Sciences 315
     (518) 442-4477
     http://www.atmos.albany.edu/facstaff/brose/
|--------------------------------------------------------------------------------------------------------------------------------------|

_______________________________________________
MITgcm-support mailing list
MITgcm-support at mitgcm.org<mailto:MITgcm-support at mitgcm.org>
http://mitgcm.org/mailman/listinfo/mitgcm-support

|--------------------------------------------------------------------------------------------------------------------------------------|
  Brian E. J. Rose
     Assistant Professor
     Atmospheric & Environmental Sciences
     University at Albany

     Earth Sciences 315
     (518) 442-4477
     http://www.atmos.albany.edu/facstaff/brose/
|--------------------------------------------------------------------------------------------------------------------------------------|

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mitgcm.org/pipermail/mitgcm-support/attachments/20141105/7808d2b1/attachment-0001.htm>


More information about the MITgcm-support mailing list