[MITgcm-support] Coupled model running!

taimaz.bahadory taimaz.bahadory at mun.ca
Wed Oct 3 14:51:49 EDT 2012


There is a "stdout" file generated in the main directory of run, with these
contents:

***********************************************************************************************************************************
CMA: unable to get RDMA device list
librdmacm: couldn't read ABI version.
librdmacm: assuming: 4
librdmacm: couldn't read ABI version.
librdmacm: assuming: 4
CMA: unable to get RDMA device list
librdmacm: couldn't read ABI version.
librdmacm: assuming: 4
CMA: unable to get RDMA device list
--------------------------------------------------------------------------
[[9900,1],2]: A high-performance Open MPI point-to-point messaging module
was unable to find any relevant network interfaces:

Module: OpenFabrics (openib)
  Host: glacdyn

Another transport will be used instead, although this may result in
lower performance.
--------------------------------------------------------------------------
 CPL_READ_PARAMS: nCouplingSteps=           5
 runoffmapFile =>>runOff_cs32_3644.bin<<= , nROmap=  3644
 ROmap:    1  599  598 0.100280
 ROmap: 3644 4402 4403 0.169626
[glacdyn:04864] 2 more processes have sent help message
help-mpi-btl-base.txt / btl:no-nics
[glacdyn:04864] Set MCA parameter "orte_base_help_aggregate" to 0 to see
all help / error messages
***********************************************************************************************************************************

Maybe there would be some relation between these error-like messages and
run stuck!





On Wed, Oct 3, 2012 at 2:13 PM, taimaz.bahadory <taimaz.bahadory at mun.ca>wrote:

> Re-Hi;
>
> Yes; as I said, it stuck again. I check the CPU. It is fully loaded, but
> the output file is not updated! It is only a few seconds younger than the
> run initiation.
>
>
>
> On Wed, Oct 3, 2012 at 1:32 PM, taimaz.bahadory <taimaz.bahadory at mun.ca>wrote:
>
>> Hi;
>>
>> I guess I've tried it too, but the same problem occurred (I will try it
>> again right now to check it again).
>> Will report soon
>> Thanks
>>
>>
>>
>> On Wed, Oct 3, 2012 at 1:29 PM, Jean-Michel Campin <jmc at ocean.mit.edu>wrote:
>>
>>> Hi Taimaz,
>>>
>>> Can you try without MNC ? The current set-up (cpl_aim+ocn) does not
>>> use MNC (useMNC=.TRUE., is commented out in both input_atm/data.pkg
>>> and input_ocn/data.pkg) so if there was a problem in the coupled set-up
>>> code
>>> with NetCDF output, might not have seen it (since I did not try recently
>>> with it).
>>>
>>> Cheers,
>>> Jean-Michel
>>>
>>> On Tue, Sep 25, 2012 at 11:54:59AM -0230, taimaz.bahadory wrote:
>>> > Hi everybody;
>>> >
>>> > I'm trying to run the coupled model example (cpl_aim+ocn) in the
>>> > verification directory. All the three first steps (Cleaning; Compiling
>>> and
>>> > Making; Copying input files) passed with no error; but when I run the
>>> > coupler, it starts and creates the netCDF output files initially, but
>>> stops
>>> > updating them and also the output files, although the three "mitgcmuv"
>>> > files are still running! It's like a program freezing.
>>> > Has anybody been stuck in such a situation?
>>>
>>> > _______________________________________________
>>> > MITgcm-support mailing list
>>> > MITgcm-support at mitgcm.org
>>> > http://mitgcm.org/mailman/listinfo/mitgcm-support
>>>
>>>
>>> _______________________________________________
>>> MITgcm-support mailing list
>>> MITgcm-support at mitgcm.org
>>> http://mitgcm.org/mailman/listinfo/mitgcm-support
>>>
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mitgcm.org/pipermail/mitgcm-support/attachments/20121003/dfe10333/attachment.htm>


More information about the MITgcm-support mailing list