[MITgcm-support] changing number of processors
Angela Zalucha
azalucha at seti.org
Fri Feb 6 13:38:38 EST 2015
Hi,
I'm not sure why you would be getting NaN's, but I have found that there
is a trick to increasing the number of processors. I ran on a machine
that has 12 processes per node, and the highest number of processors I
could run was 1536 (I should point out that at high processor numbers, I
found the code to be less efficient, so if you have a limited amount of
processor hours, you might be better off running with fewer processors,
e.g.: the wall clock time difference between 768 and 1536 processors is
only a factor of 1.03).
Anyway, here is my SIZE.h parameters:
sNx=2
sNy=2
nSx=1
nSy=1
nPx=96
nPy=16
I have noticed during my scaling tests (and maybe someone can confirm my
explanations for this behavior that:
1) scaling tests on a 12 processors per node machine had faster wall
clock times for a 12 processor/node test than a 16 processor/node test,
I think owing to the the cube-sphere geometry having a "built-in" factor
of 6, and communication across cube faces gets strange when the number
of processors is not a factor of 6)
(this deeply saddens me because the 12 processor machine I used to use
was retired Jan. 1, and now I have to run on a 16 processor machine,
even this is the wave of the future, it hurts my efficiency)
2) sNx*nSx*nPx = 192 and sNy*nSy*nPy=32
3) For the same number of processors, faster wall clock times are
achieved when nSx and nSy are minimized.
I can produce tables and tables of configurations if you want, since at
low processors there is degeneracy between sNx,nSx,nPx and
sNy,nSy,nPy, respectively.
Angela
On 02/06/2015 08:45 AM, Jonny Williams wrote:
> Hi eveyrone
>
> I'm trying to run my regional model on 480 processors, up from a
> successfully working 48 procesor version.
>
> I have recompiled my code.
>
> To do this (in SIZE.h) I reduced sNy by a factor of 10 and increased nPy
> by a factor of ten so that nPx*nPy was increased by a factor of 10,
> which I think is the total number of processors.
>
> The executable was created fine and the model does run but the data I am
> getting out in my NetCDF files (mnc package) is all NaNs.
>
> Has anyone encountered this type of issue or know how to fix it?
>
> Is there a maximum number of processors?
>
> Many thanks
>
> Jonny
>
> --
> Dr Jonny Williams
> School of Geographical Sciences
> Cabot Institute
> University of Bristol
> BS8 1SS
>
> +44 (0)117 3318352
> jonny.williams at bristol.ac.uk <mailto:jonny.williams at bristol.ac.uk>
> http://www.bristol.ac.uk/geography/people/jonny-h-williams
> <http://bit.ly/jonnywilliams>
>
>
> _______________________________________________
> MITgcm-support mailing list
> MITgcm-support at mitgcm.org
> http://mitgcm.org/mailman/listinfo/mitgcm-support
>
--
=====================
Angela Zalucha, PhD
Research Scientist
SETI Institute
+1 (617) 894-2937
=====================
More information about the MITgcm-support
mailing list