[MITgcm-support] Problems with mpi on beowulf
Yuan Lian
lian at U.Arizona.EDU
Fri Sep 17 23:16:35 EDT 2004
Hi,
I encountered a problem when I ran MITgcm with mpich on a beowulf cluster.
Here is the error message from output:
>Warning: no access to tty (Bad file descriptor).
>Thus no job control in this shell.
This seems like I didn't choose the proper shell. I was running the code
under csh which was defined in script file as "#PBS -S /bin/csh".
>(PID.TID 0000.0001) *** ERROR *** S/R EESET_PARMS
>(PID.TID 0000.0001) *** ERROR *** Unable to open execution environment
>(PID.TID 0000.0001) *** ERROR *** parameter file "eedata"
This error message shows that the code can't read eedata, (the default
folds was changed to my home directory instead of workdir!!)I defined
"usingMPI = .TRUE." and moved all programs under workdir to home
directory, then code will run, however the results of simulation are
obviously wrong comparing to single cpu. Here are the questions:
1. How to define the variables in eedate properly?
2. Do I still need joinmds to incorporate the ouputs when I use mpirun to
run the code?(it seems that all the outputs has been incorporated)
3. Does anybody successfully run the code on Beowulf cluster?(the default
compiler is g77 or pgf77 with mpich?
Thanks!
Best regards,
Yuan
More information about the MITgcm-support
mailing list