[MITgcm-devel] testing with mpi
Jean-Michel Campin
jmc at ocean.mit.edu
Thu Nov 11 16:55:51 EST 2004
Hello Ed,
I have a suggestion regarding testreport and the testing with mpi:
generally (at least on myrinet cluster) the run always return
a zero output code (RETVAL=0), as if everything was OK, even when
the run stop immediately.
It means that in testreport (arround line 430):
> ( eval $COMMAND ) > run.log 2>&1
> RETVAL=$?
> if test "x$RETVAL" = x0 ; then
> tail run.log
> echo successful 1>&2
will not necessary catch a failure in running the model.
And in fact, it's what happens with fizhi-gridalt-hs in all
recent mpi testing on cg01.
Could we save also STDERR.000? (in $CDIR, in addition to Makefile_head
and summary.txt) so that we would have some idea about what's wrong.
And the advantage is that those files (STDERR.000?) are generally
very small (most of the time empty) so that it will not require
too much disk space.
Thanks,
Jean-Michel
More information about the MITgcm-devel
mailing list