sge_shadowd - Sun Grid Engine shadow master daemon
sge_shadowd is a "light weight" process which can be run on so-called
shadow master hosts in a Sun Grid Engine cluster to detect failure of
the current Sun Grid Engine master daemon, sge_qmaster(8), and to
start-up a new sge_qmaster(8) on the host on which the sge_shadowd
runs. If multiple shadow daemons are active in a cluster, they run a
protocol which ensures that only one of them will start-up a new master
The hosts suitable for being used as shadow master hosts must have
shared root read/write access to the directory
$SGE_ROOT/$SGE_CELL/common as well as to the master daemon spool
directory (by default $SGE_ROOT/$SGE_CELL/spool/qmaster). The names of
the shadow master hosts need to be contained in the file
sge_shadowd may only be started by root.
SGE_ROOT Specifies the location of the Sun Grid Engine standard
SGE_CELL If set, specifies the default Sun Grid Engine cell. To
address a Sun Grid Engine cell sge_shadowd uses (in the
order of precedence):
The name of the cell specified in the environment
variable SGE_CELL, if it is set.
The name of the default cell, i.e. default.
If set, specifies that debug information should be
written to stderr. In addition the level of detail in
which debug information is generated is defined.
If set, specifies the tcp port on which sge_qmaster(8)
is expected to listen for communication requests. Most
installations will use a services map entry for the
service "sge_qmaster" instead to define that port.
SGE_DELAY_TIME This variable controls the interval in which sge_shadowd
pauses if a takeover bid fails. This value is used only
when there are multiple sge_shadowd instances and they
are contending to be the master. The default is 600
This variable controls the interval in which the
sge_shadowd checks the heartbeat file (60 seconds by
This variable controls the interval when a sge_shadowd
instance tries to take over when the heartbeat file has
Default configuration directory
Shadow master hostname file.
Default master daemon spool directory
The heartbeat file.
sge_intro(1), sge_conf(5), sge_qmaster(8), Sun Grid Engine Installation
and Administration Guide.
See sge_intro(1) for a full statement of rights and permissions.