[MITgcm-devel] [MITgcm-support] BLOCKING_EXCHANGES slowdown when using pkg/ptracers on Columbia

Holly Dail hdail at MIT.EDU
Fri Mar 12 09:25:31 EST 2010


Hello -

I found about a 50% slowdown on long runs on the Altix here at MIT  
with 7 tracers.

My runs were of a 120x108x23 N. Atlantic grid, 3600 sec time step on  
20 processors and using 7 tracers.  300 years of simulation took 60 h  
without tracers and about 91 h with.

Holly

On Mar 12, 2010, at Mar 12 , 2:29 AM, Martin Losch wrote:

> I agree with Jean-Michel, it is puzzling that even 1 ptracer is  
> enough to cause this problem (and in this case my hack won't do  
> anything).
>
> As I said, my solution is a hack, probably specific to the platform  
> I was using, that made my simulations (with 16 tracers) fast enough  
> so that I could continue to with the integration, I never bothered  
> to go into detail with this.
>
> Just as additional information. In my/our case we were runing on  
> JUMP, a IBM690/P4 cluster in Jülich (now offline) and observed bad  
> scaling with # of cpu, with the hack we were happy with the scaling  
> and did not investigate things any further (see here for a terrible  
> gray paper: <http://epic.awi.de/Publications/Los2008a.pdf> that  
> describes the problem).
>
> Martin
>
> On Mar 11, 2010, at 10:48 PM, Jean-Michel Campin wrote:
>
>> Hi,
>>
>> I think it would still be usefull to understand why Dimitris
>> see such a slow down, specially with only 1 tracer (and in this case,
>> Martin's solution, to copy back and forth to an array which has
>> exactly the same shape, is unlikely to speed up the run).
>> After, the 8 tracers case scale like the 1 tracer, and is easy to
>> interpret if we understand the 1 tracer case.
>>
>> Jean-Michel
>>
>> On Thu, Mar 11, 2010 at 05:30:43PM +0100, Martin Losch wrote:
>>> Hi Constantinos,
>>>
>>> I am moving this to the devel-list:
>>>
>>> I would have added the code long ago, if it had been clean, but it  
>>> isn't. Jean-Michel might even remove my cvs privileges if he only  
>>> _sees_ the code changes (o:
>>>
>>> I am happy to provide my changes (they are with very much outdated  
>>> code), and Dimitris can see if it works for him. Then we should  
>>> think about how to include this in a more general context, but  
>>> that's way beyond my technical (and currently even time)  
>>> capabilities.
>>>
>>> Martin
>>> PS. Here comes the code (between checkpoint59n and p).
>>>
>>
>>
>>>
>>>
>>> On Mar 11, 2010, at 5:02 PM, Constantinos Evangelinos wrote:
>>>
>>>> On Thursday 11 March 2010 05:52:23 am Martin Losch wrote:
>>>>
>>>>> I observed a huge slowdown with ptracers as well, the solution  
>>>>> was to copy
>>>>> ptracers into a new 5D-array where the 3rd index (k) is  
>>>>> nPtracer*Nr long,
>>>>
>>>> You mean 3D I take it.
>>>>
>>>>> and then use one exchange for that (and I think I had to hack the
>>>>> corresponding exchange routine to have it allow such large  
>>>>> arrays). Can
>>>>> provide code if required.
>>>>
>>>> Since more than one person has this problem it might be a good  
>>>> time to add the
>>>> code (as an IFDEF) in the main code.
>>>>
>>>> Constantinos
>>>> -- 
>>>> Dr. Constantinos Evangelinos
>>>> Department of Earth, Atmospheric and Planetary Sciences
>>>> Massachusetts Institute of Technology
>>>>
>>>>
>>>> _______________________________________________
>>>> MITgcm-support mailing list
>>>> MITgcm-support at mitgcm.org
>>>> http://mitgcm.org/mailman/listinfo/mitgcm-support
>>>
>>
>>> _______________________________________________
>>> MITgcm-devel mailing list
>>> MITgcm-devel at mitgcm.org
>>> http://mitgcm.org/mailman/listinfo/mitgcm-devel
>>
>>
>> _______________________________________________
>> MITgcm-devel mailing list
>> MITgcm-devel at mitgcm.org
>> http://mitgcm.org/mailman/listinfo/mitgcm-devel
>
>
> _______________________________________________
> MITgcm-devel mailing list
> MITgcm-devel at mitgcm.org
> http://mitgcm.org/mailman/listinfo/mitgcm-devel




More information about the MITgcm-devel mailing list