[MITgcm-support] mnc pickups

jeff polton jpolton at ucsd.edu
Mon Sep 19 16:06:11 EDT 2005


Hi
In summary I tried the new mnc files on 3 similar runs which stopped 
for 2 different reasons after a number of hours running (one of which I 
don't understand). The diagnostics output files however all have the 
same time stamp as the error files.
I'll decribe what happened.
"cvs -n update" told me I needed 2 files from pkg/mnc so I got them
$cvs update mnc_cw_readwrite.template
$cvs update mnc_cw_cvars.F

I restarted the runs from pickups, changing only the chkptFreq to 
something nonzero and nIter0 and nTimeSteps.

The 1st to fail failed with the following error
> forrtl: severe (66): output statement overflows record, unit -5, file 
> Internal Formatted Write
> Image              PC                  Routine            Line        
> Source
> mitgcmuv           0x4000000000408ab0  Unknown               Unknown  
> Unknown
> mitgcmuv           0x400000000036d6c0  Unknown               Unknown  
> Unknown
> mitgcmuv           0x400000000036df30  Unknown               Unknown  
> Unknown
> mitgcmuv           0x40000000003cbe70  Unknown               Unknown  
> Unknown
> mitgcmuv           0x40000000000f44a0  Unknown               Unknown  
> Unknown
> mitgcmuv           0x40000000000d2890  Unknown               Unknown  
> Unknown
> mitgcmuv           0x40000000000cc2c0  Unknown               Unknown  
> Unknown
> mitgcmuv           0x400000000002b6c0  Unknown               Unknown  
> Unknown
> mitgcmuv           0x4000000000040190  Unknown               Unknown  
> Unknown
> mitgcmuv           0x400000000025eb60  Unknown               Unknown  
> Unknown
> mitgcmuv           0x4000000000278a50  Unknown               Unknown  
> Unknown
> mitgcmuv           0x40000000002fd330  Unknown               Unknown  
> Unknown
> mitgcmuv           0x40000000002fd6b0  Unknown               Unknown  
> Unknown
> mitgcmuv           0x4000000000213d50  Unknown               Unknown  
> Unknown
> mitgcmuv           0x4000000000004b50  Unknown               Unknown  
> Unknown
> libc.so.6.1        0x20000000003b2970  Unknown               Unknown  
> Unknown
and STDERR.0000:
> (PID.TID 0000.0001) *** ERROR *** NetCDF ERROR: Numeric conversion not 
> representable

I had previously encountered this error and resolved ("circumnatigated" 
might be more appropriate) by reducing the number of diagnostics in 
data.diagnotics. Clearly that was not the real problem as that error 
has arisen again.

The other 2 runs both died later because I filled my 500GB limit! Doh! 
Though interestingly both made it further than the above run and both 
completing a successful pickup dump. The above failed run completed 
only a checkpoint dump.

Any advice on what the "Numeric conversion not representable" means 
would be most appreciated

Thanks,

Jeff

On Sep 18, 2005, at 7:34 PM, Ed Hill wrote:

> On Sun, 2005-09-18 at 16:38 -0700, jeff polton wrote:
>> Hi, I noticed there was a previous discussion about problems of
>> restarting from mnc pickups but the resolution was not posted on the
>> list.
>> I also have this problem.
>
> Hi Jeff,
>
> Thank you for the bug report!
>
> You ran into one of two bugs in recently added MNC code.  They are now
> fixed and everything appears to work correctly for a few tests.  So
> please grab the latest from CVS and give it a try.
>
> thanks,
> Ed
>
> -- 
> Edward H. Hill III, PhD
> office:  MIT Dept. of EAPS;  Rm 54-1424;  77 Massachusetts Ave.
>              Cambridge, MA 02139-4307
> emails:  eh3 at mit.edu                ed at eh3.com
> URLs:    http://web.mit.edu/eh3/    http://eh3.com/
> phone:   617-253-0098
> fax:     617-253-4464
>
> _______________________________________________
> MITgcm-support mailing list
> MITgcm-support at mitgcm.org
> http://mitgcm.org/mailman/listinfo/mitgcm-support
>




More information about the MITgcm-support mailing list