[MITgcm-support] mpi run with cpu more than 9999

Martin Losch Martin.Losch at awi.de
Tue Mar 10 11:59:52 EDT 2020


Hi Daquan,

I have no experience with so many processors. I assume that there will be plenty of problems with order 1e4 files open (depending on your file system), but you can fix the “*****” problem by changing the definitions in eesupp/src/eeboot_minimal.F:
Look for “USE_PDAF” to see how the names of STDERR and STDOUT are changed to have longer numbers and do something similar for the default case.

Alternatively you can define SINGLE_DISK_IO, but then only process 0 (0000) will write a STDOUT/STDERR pair.

Martin

> On 10. Mar 2020, at 12:52, Daquan Guo <Daquan.Guo at kaust.edu.sa> wrote:
> 
> Dear mitgcm community and developers,
> 
> I am running a case with 16650 cpus and facing a problem. 
> It seems the files scratch.*, STDERR.* and STDOUT.* can not be written well if the number exceeds 9999, instead it generates one file named 'scratch.****', which can not be read and processed then the model crashed. 
> I am wondering if anyone has experience on this and knows how to fix it?
> Thanks in advance.
> 
> Best,
> Daquan
> 
>   
> 
> 
> 
> 
> This message and its contents, including attachments are intended solely for the original recipient. If you are not the intended recipient or have received this message in error, please notify me immediately and delete this message from your computer system. Any unauthorized use or distribution is prohibited. Please consider the environment before printing this email._______________________________________________
> MITgcm-support mailing list
> MITgcm-support at mitgcm.org
> http://mailman.mitgcm.org/mailman/listinfo/mitgcm-support



More information about the MITgcm-support mailing list