[MITgcm-devel] yesterday changes in global_ocean.cs32x15.viscA4
Martin Losch
Martin.Losch at awi.de
Mon May 9 11:29:08 EDT 2016
Hi Jean-Michel,
here’s the content of the directory and the error message:
stan1:tr_run.viscA4> ls -l pickup*
lrwxrwxrwx 1 mlosch CLIDYN 26 May 13 2015 pickup.0000072000 -> ../input/pickup.0000072000
lrwxrwxrwx 1 mlosch CLIDYN 31 May 13 2015 pickup.0000072000.meta -> ../input/pickup.0000072000.meta
lrwxrwxrwx 1 mlosch CLIDYN 17 May 13 2015 pickup.0000086400 -> pickup.0000072000
lrwxrwxrwx 1 mlosch CLIDYN 38 May 13 2015 pickup.0000086400.meta -> ../input.viscA4/pickup.0000086400.meta
-rw-r--r-- 1 mlosch CLIDYN 7520256 May 8 06:46 pickup.ckptA.data
-rw-r--r-- 1 mlosch CLIDYN 379 May 8 06:46 pickup.ckptA.meta
-rw-r--r-- 1 mlosch CLIDYN 1474560 Jul 16 2006 pickup_nh.0000086400
lrwxrwxrwx 1 mlosch CLIDYN 39 May 14 2015 pickup_nh.0000086400.gz -> ../input.viscA4/pickup_nh.0000086400.gz
-rw-r--r-- 1 mlosch CLIDYN 737280 May 8 20:37 pickup_ph.0000086400
stan1:tr_run.viscA4> cat STDERR.0000 | tail -5
(PID.TID 0000.0001) WARNING >> READ_PICKUP: no field-list found
(PID.TID 0000.0001) WARNING >> try to read pickup as it used to be written
(PID.TID 0000.0001) WARNING >> until checkpoint59i (2007 Oct 22)
(PID.TID 0000.0001) *** ERROR *** MDS_READ_FIELD: filename: pickup_ph.0000086400.data
(PID.TID 0000.0001) *** ERROR *** MDS_READ_FIELD: File does not exist
So pickup_ph.00000864000 is there, but I also have this error message:
run_clean skipped!
linkdata from dirs: input.viscA4 input
ldir=input.viscA4: pickup_ph.0000086400.gz ;../input.viscA4/prepare_run: line 24: gunzip: command not found
unzip files: pickup_ph.0000086400 ;
ldir=input: eedata ; link files: from dir: ../../tutorial_held_suarez_cs/input
runmodel in global_ocean.cs32x15/tr_run.viscA4 ... failed (run: 1 end: 0 )
=> output from running in global_ocean.cs32x15/tr_run.viscA4 :
> STOP ABNORMAL END: S/R MDS_READ_FIELD
> ./mitgcmuv(lang:f90): signal trap(SIGTERM: Software termination)
> stan-004: mpid: MPI process (universe = 0, rank = 1) terminated by exit(1)
I am not sure what’s happening, running the experiment alone with testreport ( like this:
./testreport -t global_ocean.cs32x15 -MPI 2 -of=../tools/build_options/SUPER-UX_SX-ACE_sxf90_awi -small_f -runonly -command "mpirun -np TR_NPROC ./mitgcmuv” )
does not work (above error messages again), then I ran it directly (mpirun -np 2 ./mitgcmuv) which worked, then ran it again with testreport, and it worked again.
I am not sure how to debug this. Definitly “gunzip” does not work on the compute nodes of stan, maybe that’s the issue, because that’s the only place where gunzip is used?
Martin
> On 09 May 2016, at 16:14, Jean-Michel Campin <jmc at mit.edu> wrote:
>
> Hi Martin,
>
> It seems that the changes I made yesterday (most likely in
> verification/global_ocean.cs32x15/input.viscA4/prepare_run
> ) are causing problems for the test you run on stan1.
> The file: pickup_ph.0000086400 is not found but should have been
> gunzip when running "prepare_run"
>
> When you have time, if you could take a look and tell me what is
> wrong in "prepare_run", this would be nice.
>
> Thanks,
> Jean-Michel
>
> _______________________________________________
> MITgcm-devel mailing list
> MITgcm-devel at mitgcm.org
> http://mitgcm.org/mailman/listinfo/mitgcm-devel
More information about the MITgcm-devel
mailing list