[MITgcm-support] Problems with mpi on beowulf

Yuan Lian lian at U.Arizona.EDU
Fri Sep 17 23:16:35 EDT 2004


Hi,

I encountered a problem when I ran MITgcm with mpich on a beowulf cluster.
Here is the error message from output:

>Warning: no access to tty (Bad file descriptor).
>Thus no job control in this shell.

This seems like I didn't choose the proper shell. I was running the code
under csh which was defined in script file as "#PBS -S /bin/csh".

>(PID.TID 0000.0001) *** ERROR *** S/R EESET_PARMS
>(PID.TID 0000.0001) *** ERROR *** Unable to open execution environment
>(PID.TID 0000.0001) *** ERROR *** parameter file "eedata"

This error message shows that the code can't read eedata, (the default
folds was changed to my home directory instead of workdir!!)I defined
"usingMPI = .TRUE." and moved all programs under workdir to home
directory, then code will run, however the results of simulation are
obviously wrong comparing to single cpu. Here are the questions:

1. How to define the variables in eedate properly?
2. Do I still need joinmds to incorporate the ouputs when I use mpirun to
   run the code?(it seems that all the outputs has been incorporated)
3. Does anybody successfully run the code on Beowulf cluster?(the default
   compiler is g77 or pgf77 with mpich?

Thanks!

Best regards,
Yuan



More information about the MITgcm-support mailing list