[Aces-support] job exit unexpected on geo

aces-admin at techsquare.com aces-admin at techsquare.com
Tue Apr 3 11:23:29 EDT 2007


hello lurr-

i have made some changes to the
exports file at geo that should 
help out here. 

the short story is that the compute-
node you were assigned had so many 
network connexions that it was forced
to use a high-port to request the nfs-
mount of your home directory. by default,
such requests are refused and your job
would fail out straight-away.

i have enabled such requests to be accepted
from the compute-nodes. 

[greg]


> Date: Mon, 02 Apr 2007 20:43:23 -0400
> From: Rongrong Lu <lurr at mit.edu>
> MIME-Version: 1.0
> Cc: 
> Reply-To: ACES-support at mitgcm.org
> 
> I had met the same problem today as what Yang met.
> The job quit right after I submit it, the error message is the following:
> -bash: line 1: /var/torque/mom_priv/jobs/3605.geo.SC: No such file or 
> directory
> The same job was submitted successfully yesterday without any problem.
> 
> 
> Rongrong Lu
> 
> --------------------------------------------
> Earth Resources Laboratory, MIT
> 42 Carleton St. E34-370, Cambridge, MA 02142
> Tel:     617-253-7835 (o)  617-230-6729 (m)
> Email:   lurr at mit.edu
> Web:     http://web.mit.edu/lurr
> --------------------------------------------
> 
> Yang Zhang wrote:
> > Hi,
> > 
> > I have a job exited by no reason just after I submitted on geo.  It 
> > worked well if I ran it in the interactive mode on geo.  The jobid of my 
> > job is: 3604.geo.  Can you help check this for me?  This problem 
> > happened couple of weeks ago, and it seems still there.
> > 
> > Thanks,
> > Yang
> > _______________________________________________
> > Aces-support mailing list
> > Aces-support at acesgrid.org
> > http://acesgrid.org/mailman/listinfo/aces-support
> _______________________________________________
> Aces-support mailing list
> Aces-support at acesgrid.org
> http://acesgrid.org/mailman/listinfo/aces-support
> 



More information about the Aces-support mailing list