[MITgcm-support] cubeSphereExchange and MPI error?
Anthony Coletti
ajcolett at geo.umass.edu
Tue Jul 8 15:13:08 EDT 2014
Hi Jean-Michel,
I have realized I made the mistake of not specifying -mods=../code in my batch script. That could be why none of my packages installed - so you are correct in mentioning that. Thank you. Unfortunately another error has popped up and the model still crashes after ~6 seconds:
ModuleCmd_Switch.c(172):ERROR:152: Module 'PrgEnv-cray' is currently not loaded
At line 2825 of file ini_parms.f (unit = 11, file = '/lustre/medusa/acoletti/gfortrantm
pLb7lfy')
Fortran runtime error: Cannot match namelist object name .
At line 2825 of file ini_parms.f (unit = 11, file = '/lustre/medusa/acoletti/gfortrantm
p0SUaGP')
Fortran runtime error: Cannot match namelist object name .
At line 2825 of file ini_parms.f (unit = 11, file = '/lustre/medusa/acoletti/gfortrantm
pjDleuD')
Fortran runtime error: Cannot match namelist object name .
At line 2825 of file ini_parms.f (unit = 11, file = '/lustre/medusa/acoletti/gfortrantm
p0mWCfy')
There are 20 (or so) more lines of this error each specifying a different gfortran tmp file.
Here is line 2825 of my ini_parms.f file. I think it is having a problem with PARM03 in the 'data' file? I am not completely familiar with fortran so I am not sure what namelist object it is referring to:
816 C-- Time stepping parameters
2817 rCD = -1.D0
2818 epsAB_CD = UNSET_RL
2819 latBandClimRelax = UNSET_RL
2820 deltaTtracer = 0.D0
2821 forcing_In_AB = .TRUE.
2822 WRITE(msgBuf,'(A)') ' INI_PARMS ; starts to read PARM03'
2823 CALL PRINT_MESSAGE( msgBuf, standardMessageUnit,
2824 & SQUEEZE_RIGHT, myThid )
2825 READ(UNIT=iUnit,NML=PARM03) !,IOSTAT=errIO)
2826 IF ( errIO .LT. 0 ) THEN
2827 WRITE(msgBuf,'(A)')
2828 & 'S/R INI_PARMS: Error reading model parameter file "data"'
2829 CALL PRINT_ERROR( msgBuf, myThid )
2830 WRITE(msgBuf,'(A)') 'S/R INI_PARMS: Problem in namelist PARM03'
2831 CALL PRINT_ERROR( msgBuf, myThid )
2832 STOP 'ABNORMAL END: S/R INI_PARMS'
2833 ELSE
2834 WRITE(msgBuf,'(A)') ' INI_PARMS ; read PARM03 : OK'
2835 CALL PRINT_MESSAGE( msgBuf, standardMessageUnit,
2836 & SQUEEZE_RIGHT, myThid )
2837 ENDIF
Anthony
Anthony J. Coletti
Climate System Research Center
Department of Geosciences
Morrill Building
611 N. Pleasant Street
233 Morrill Science Center
University of Massachusetts-Amherst
paleoclimate.org
Email: ajcolett at geo.umass.edu
http://blogs.umass.edu/ajcolett/
http://necsc.umass.edu/people/anthony-coletti
“For me, I am driven by two main philosophies: know more today about the world than I knew yesterday and lessen the suffering of others. You'd be surprised how far that gets you.” ― Neil deGrasse Tyson
On Jul 8, 2014, at 8:45 AM, Anthony Coletti <ajcolett at geo.umass.edu> wrote:
> Hi Jean-Michel,
>
> Let’s start with the exch2 problem,
>
> So I check the PACKAGES_CONFIG.h and it seems you are right..for some reason, exch2 is undef
>
> Here is the copy of my PACKAGES_CONFIG.h file:
>
> reated by convert_cpp_cmd2defines with the following command line arguments:
> -bPACKAGES_CONFIG_H Disabled packages: -UALLOW_ADMTLM -UALLOW_AIM_V23 -UALLOW_ATM2D -UALLOW_ATM_COMMON -UALLOW_A
> TM_COMPON_INTERF -UALLOW_ATM_OCN_COUPLER -UALLOW_ATM_PHYS -UALLOW_AUTODIFF -UALLOW_BBL -UALLOW_BULK_FORCE -UALLOW
> _CAL -UALLOW_CD_CODE -UALLOW_CFC -UALLOW_CHEAPAML -UALLOW_CHRONOS -UALLOW_COMPON_COMMUNIC -UALLOW_COST -UALLOW_CT
> RL -UALLOW_DIAGNOSTICS -UALLOW_DIC -UALLOW_DOWN_SLOPE -UALLOW_EBM -UALLOW_ECCO -UALLOW_EMBED_FILES -UALLOW_EXCH2
> -UALLOW_EXF -UALLOW_FIZHI -UALLOW_FLT -UALLOW_FRAZIL -UALLOW_GCHEM -UALLOW_GGL90 -UALLOW_GMREDI -UALLOW_GRDCHK -U
> ALLOW_GRIDALT -UALLOW_ICEFRONT -UALLOW_KPP -UALLOW_LAND -UALLOW_LAYERS -UALLOW_LONGSTEP -UALLOW_MATRIX -UALLOW_MN
> C -UALLOW_MY82 -UALLOW_MYPACKAGE -UALLOW_OBCS -UALLOW_OCN_COMPON_INTERF -UALLOW_OFFLINE -UALLOW_OPENAD -UALLOW_OP
> PS -UALLOW_PP81 -UALLOW_PROFILES -UALLOW_PTRACERS -UALLOW_RBCS -UALLOW_REGRID -UALLOW_RUNCLOCK -UALLOW_SALT_PLUME
> -UALLOW_SBO -UALLOW_SEAICE -UALLOW_SHAP_FILT -UALLOW_SHELFICE -UALLOW_SHOWFLOPS -UALLOW_SMOOTH -UALLOW_SPHERE -U
> ALLOW_STREAMICE -UALLOW_THSICE -UALLOW_TIMEAVE -UALLOW_ZONAL_FILT Enabled packages: -DALLOW_DEBUG -DALLOW_GENER
> IC_ADVDIFF -DALLOW_MDSIO -DALLOW_MOM_COMMON -DALLOW_MOM_FLUXFORM -DALLOW_MOM_VECINV -DALLOW_MONITOR -DALLOW_RW
> */
>
> #ifndef PACKAGES_CONFIG_H
> #define PACKAGES_CONFIG_H
> /* Disabled packages: */
> #undef ALLOW_ADMTLM
> #undef ALLOW_AIM_V23
> #undef ALLOW_ATM2D
> #undef ALLOW_ATM_COMMON
> #undef ALLOW_ATM_COMPON_INTERF
> #undef ALLOW_ATM_OCN_COUPLER
> #undef ALLOW_ATM_PHYS
> #undef ALLOW_AUTODIFF
> #undef ALLOW_BBL
> #undef ALLOW_BULK_FORCE
> #undef ALLOW_CAL
> #undef ALLOW_CD_CODE
> #undef ALLOW_CFC
> #undef ALLOW_CHEAPAML
> #undef ALLOW_CHRONOS
> #undef ALLOW_COMPON_COMMUNIC
> #undef ALLOW_COST
> #undef ALLOW_CTRL
> #undef ALLOW_DIAGNOSTICS
> #undef ALLOW_DIC
> #undef ALLOW_DOWN_SLOPE
> #undef ALLOW_EBM
> #undef ALLOW_ECCO
> #undef ALLOW_EMBED_FILES
> #undef ALLOW_EXCH2
> #undef ALLOW_EXF
> #undef ALLOW_FIZHI
> #undef ALLOW_FLT
> #undef ALLOW_FRAZIL
> #undef ALLOW_GCHEM
> #undef ALLOW_GGL90
> #undef ALLOW_GMREDI
> #undef ALLOW_GRDCHK
> #undef ALLOW_GRIDALT
> #undef ALLOW_ICEFRONT
> #undef ALLOW_KPP
> #undef ALLOW_LAND
> #undef ALLOW_LAYERS
> #undef ALLOW_LONGSTEP
> #undef ALLOW_MATRIX
> #undef ALLOW_MNC
> #undef ALLOW_MY82
> #undef ALLOW_MYPACKAGE
> #undef ALLOW_OBCS
> #undef ALLOW_OCN_COMPON_INTERF
> #undef ALLOW_OFFLINE
> #undef ALLOW_OPENAD
> #undef ALLOW_OPPS
> #undef ALLOW_PP81
> #undef ALLOW_PROFILES
> #undef ALLOW_PTRACERS
> #undef ALLOW_RBCS
> #undef ALLOW_REGRID
> #undef ALLOW_RUNCLOCK
> #undef ALLOW_SALT_PLUME
> #undef ALLOW_SBO
> #undef ALLOW_SEAICE
> #undef ALLOW_SHAP_FILT
> #undef ALLOW_SHELFICE
> #undef ALLOW_SHOWFLOPS
> #undef ALLOW_SMOOTH
> #undef ALLOW_SPHERE
> #undef ALLOW_STREAMICE
> #undef ALLOW_THSICE
> #undef ALLOW_TIMEAVE
> #undef ALLOW_ZONAL_FILT
> /* */
> /* Enabled packages: */
> #define ALLOW_DEBUG
> #define ALLOW_GENERIC_ADVDIFF
> #define ALLOW_MDSIO
> #define ALLOW_MOM_COMMON
> #define ALLOW_MOM_FLUXFORM
> #define ALLOW_MOM_VECINV
> #define ALLOW_MONITOR
> #define ALLOW_RW
> #endif /* PACKAGES_CONFIG_H */
>
>
> I wonder why that would be considering I have the packages to install listed in my packages.config file.
>
> This is the command I use for compiling the GCM which seems correct:
> ../../../tools/genmake2 -optfile=../../../tools/build_options/linux_amd64_gfortran -mpi
>
>
> And here is some lines from genmake2:
>
> === Processing options files and arguments ===
> getting local config information: none found
> Warning: ROOTDIR was not specified ; try using a local copy of MITgcm found at "../../.."
> getting OPTFILE information:
> using OPTFILE="../../../tools/build_options/linux_amd64_gfortran"
> getting AD_OPTFILE information:
> using AD_OPTFILE="../../../tools/adjoint_options/adjoint_default"
> check makedepend (local: 0, system: 0, 0)
> Turning on MPI cpp macros
>
> === Checking system libraries ===
> Do we have the system() command using gfortran... yes
> Do we have the fdate() command using gfortran... yes
> Do we have the etime() command using gfortran... no
> Can we call simple C routines (here, "cloc()") using gfortran... yes
> Can we unlimit the stack size using gfortran... yes
> Can we register a signal handler using gfortran... yes
> Can we use stat() through C calls... yes
> Can we create NetCDF-enabled binaries... yes
> Can we create LAPACK-enabled binaries... no
> Can we call FLUSH intrinsic subroutine... yes
>
> === Setting defaults ===
> Adding MODS directories:
> Making source files in eesupp from templates
> Making source files in pkg/exch2 from templates
> Making source files in pkg/regrid from templates
>
> === Determining package settings ===
> getting package dependency info from ../../../pkg/pkg_depend
> getting package groups info from ../../../pkg/pkg_groups
> checking list of packages to compile:
> before group expansion packages are: default_pkg_list
> replacing "default_pkg_list" with: gfd
> replacing "gfd" with: mom_common mom_fluxform mom_vecinv generic_advdiff debug mdsio rw monitor
> after group expansion packages are: mom_common mom_fluxform mom_vecinv generic_advdiff debug mdsio rw monitor
> applying DISABLE settings
> applying ENABLE settings
> packages are: debug generic_advdiff mdsio mom_common mom_fluxform mom_vecinv monitor rw
> applying package dependency rules
> packages are: debug generic_advdiff mdsio mom_common mom_fluxform mom_vecinv monitor rw
> Adding STANDARDDIRS='eesupp model'
> Searching for *OPTIONS.h files in order to warn about the presence
> of "#define "-type statements that are no longer allowed:
> found CPP_EEOPTIONS="../../../eesupp/inc/CPP_EEOPTIONS.h"
> found CPP_OPTIONS="../../../model/inc/CPP_OPTIONS.h"
> Creating the list of files for the adjoint compiler.
>
> === Creating the Makefile ===
> setting INCLUDES
> Determining the list of source and include files
> Writing makefile: Makefile
> Add the source list for AD code generation
> Making list of "exceptions" that need ".p" files
> Making list of NOOPTFILES
> Add rules for links
> Adding makedepend marker
>
> FYI: I am building a simulation using the 32x6x32x15 MPI run.
>
> The makefile looks okay…it seems to be making the dependencies. I do have 2 or 3 sets of this warning:
>
> gfortran -fconvert=big-endian -fimplicit-none -mcmodel=medium -O0 -funroll-loops -c solve_tridiagonal.f
> cat solve_uv_tridiago.F | cpp -traditional -P -DWORDLENGTH=4 -DNML_TERMINATOR -DALLOW_USE_MPI -DALWAYS_USE_MPI -DALLOW_USE_MPI -DHAVE_SYSTEM -DHAVE_FDATE -DHAVE_CLOC -DHAVE_SETRLSTK -DHAVE_SIGREG -DHAVE_STAT -DHAVE_NETCDF -DHAVE_FLUSH -I/opt/cray/netcdf-hdf5parallel/4.3.1/GNU/48/include -I/opt/cray/mpt/6.3.0/gni/mpich2-gnu/48/include | ../../../tools/set64bitConst.sh > solve_uv_tridiago.f
> gfortran -fconvert=big-endian -fimplicit-none -mcmodel=medium -O0 -funroll-loops -c solve_uv_tridiago.f
> solve_uv_tridiago.f:1076.72:
>
> DO bj=2,nSy
> 1
> Warning: DO loop at (1) will be executed zero times
> solve_uv_tridiago.f:1103.24:
>
> DO bj=nSy-1,1,-1
> 1
> Warning: DO loop at (1) will be executed zero times
>
>
> I am not sure if that means anything.
>
>
> Attached is my ini_threading_environment.f (small f)
> <ini_threading_environment.f>
>
>
> Thanks for your fruitful insight!
> Anthony
>
>
> Anthony J. Coletti
> Climate System Research Center
> Department of Geosciences
> Morrill Building
> 611 N. Pleasant Street
> 233 Morrill Science Center
> University of Massachusetts-Amherst
> paleoclimate.org
> Email: ajcolett at geo.umass.edu
> http://blogs.umass.edu/ajcolett/
> http://necsc.umass.edu/people/anthony-coletti
>
> “For me, I am driven by two main philosophies: know more today about the world than I knew yesterday and lessen the suffering of others. You'd be surprised how far that gets you.” ― Neil deGrasse Tyson
>
>
>
>
> On Jul 7, 2014, at 11:30 PM, Jean-Michel Campin <jmc at ocean.mit.edu> wrote:
>
>> PACKAGES_CONFIG.h
>
> _______________________________________________
> MITgcm-support mailing list
> MITgcm-support at mitgcm.org
> http://mitgcm.org/mailman/listinfo/mitgcm-support
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mitgcm.org/pipermail/mitgcm-support/attachments/20140708/ba88bf7e/attachment-0001.htm>
More information about the MITgcm-support
mailing list