NAME
qhost - show the status of Sun Grid Engine hosts, queues, jobs
SYNTAX
qhost [ -cb ] [ -F [resource_name,...] ] [ -help ] [ -h host_list ] [
-j ] [ -l resource=val,... ] [ -u user,... ] [ -xml ].
DESCRIPTION
qhost shows the current status of the available Sun Grid Engine hosts,
queues and the jobs associated with the queues. Selection options allow
you to get information about specific hosts, queues, jobs or users. If
multiple selections are done a host is only displayed if all selection
criteria for a host are met. Without any options qhost will display a
list of all hosts without queue or job information.
OPTIONS
-F [ resource_name,... ]
qhost will present a detailed listing of the current resource
availability per host with respect to all resources (if the
option argument is omitted) or with respect to those resources
contained in the resource_name list. Please refer to the
description of the Full Format in section OUTPUT FORMATS below
for further detail.
-cb This command line switch can be used since Sun Grid Engine
version 6.2u5 in combination with other qhost(1) command line
switches. In that case the output of the corresponding command
will contain information concerning the added job to core
binding feature.
If this switch is not used then the command behaves as in
version 6.2u4 and previous versions.
If this option is used then two additional columns will be shown
for each displayed host in the output. The first is named NSOC
and represents the number of available sockets on that host. The
second additional column is named NCOR and it represents the
number of cores that are available per socket on the
corresponding machine.
-help Prints a listing of all options.
-h host_list
Prints a list of all hosts contained in host_list.
-j Prints all jobs running on the queues hosted by the shown hosts.
This switch calls -q implicitly.
-l resource[=value],...
Defines the resources to be granted by the hosts which should be
included in the host list output. Matching is performed on hosts
based on non-mutable resource availability information only.
That means load values are always ignored except the so-called
static load values (i.e. "arch", "num_proc", "mem_total",
"swap_total" and "virtual_total") ones. Also consumable
utilization is ignored. If there are multiple -l resource
requests they will be concatenated by a logical AND: a host
needs to match all resources to be displayed.
-q Show information about the queues instances hosted by the
displayed hosts.
-u user,...
Display information only on those jobs and queues being
associated with the users from the given user list.
-xml This option can be used with all other options and changes the
output to XML. The used schemas are referenced in the XML
output. The output is printed to stdout.
If the -xml parameter is combined with -cb then the XML output
will contain additional tags containing information about job to
core binding. You can find schema files with the suffix "_cb"
in the directory $SGE_ROOT/util/resources/schemas/qhost that
describe that changes.
OUTPUT FORMATS
Depending on the presence or absence of the -q or -F and -j option
three output formats need to be differentiated. PP
Default Format (without -q, -F and -j)
For each host one line is printed. The output consists of consisting of
o the Hostname
o the Architecture.
o the Number of processors.
o the Load.
o the Total Memory.
o the Used Memory.
o the Total Swapspace.
o the Used Swapspace.
If the -q option is supplied, each host status line also contains extra
lines for every queue hosted by the host consisting of,
o the queue name.
o the queue type - one of B(atch), I(nteractive), C(heckpointing),
P(arallel), T(ransfer) or combinations thereof,
o the number of reserved, used and available job slots,
o the state of the queue - one of u(nknown) if the corresponding
sge_execd(8) cannot be contacted, a(larm), A(larm), C(alendar
suspended), s(uspended), S(ubordinate), d(isabled), D(isabled),
E(rror) or combinations thereof.
If the state is a(alarm) at least one of the load thresholds defined in
the load_thresholds list of the queue configuration (see queue_conf(5))
is currently exceeded, which prevents from scheduling further jobs to
that queue.
As opposed to this, the state A(larm) indicates that at least one of
the suspend thresholds of the queue (see queue_conf(5)) is currently
exceeded. This will result in jobs running in that queue being
successively suspended until no threshold is violated.
The states s(uspended) and d(isabled) can be assigned to queues and
released via the qmod(1) command. Suspending a queue will cause all
jobs executing in that queue to be suspended.
The states D(isabled) and C(alendar suspended) indicate that the queue
has been disabled or suspended automatically via the calendar facility
of Sun Grid Engine (see calendar_conf(5)), while the S(ubordinate)
state indicates, that the queue has been suspend via subordination to
another queue (see queue_conf(5) for details). When suspending a queue
(regardless of the cause) all jobs executing in that queue are
suspended too.
If an E(rror) state is displayed for a queue, sge_execd(8) on that host
was unable to locate the sge_shepherd(8) executable on that host in
order to start a job. Please check the error logfile of that
sge_execd(8) for leads on how to resolve the problem. Please enable the
queue afterwards via the -c option of the qmod(1) command manually.
If the -F option was used, resource availability information is printed
following the host status line. For each resource (as selected in an
option argument to -F or for all resources if the option argument was
omitted) a single line is displayed with the following format:
o a one letter specifier indicating whether the current resource
availability value was dominated by either
`g' - a cluster global,
`h' - a host total or
o a second one letter specifier indicating the source for the current
resource availability value, being one of
`l' - a load value reported for the resource,
`L' - a load value for the resource after administrator defined load
scaling has been applied,
`c' - availability derived from the consumable resources facility
(see complexes(5)),
`f' - a fixed availability definition derived from a non-consumable
complex attribute or a fixed resource limit.
o after a colon the name of the resource on which information is
displayed.
o after an equal sign the current resource availability value.
The displayed availability values and the sources from which they
derive are always the minimum values of all possible combinations.
Hence, for example, a line of the form "qf:h_vmem=4G" indicates that a
queue currently has a maximum availability in virtual memory of 4
Gigabyte, where this value is a fixed value (e.g. a resource limit in
the queue configuration) and it is queue dominated, i.e. the host in
total may have more virtual memory available than this, but the queue
doesn't allow for more. Contrarily a line "hl:h_vmem=4G" would also
indicate an upper bound of 4 Gigabyte virtual memory availability, but
the limit would be derived from a load value currently reported for the
host. So while the queue might allow for jobs with higher virtual
memory requirements, the host on which this particular queue resides
currently only has 4 Gigabyte available.
After the queue status line (in case of -j) a single line is printed
for each job running currently in this queue. Each job status line
contains
o the job ID,
o the job name,
o the job owner name,
o the status of the job - one of t(ransfering), r(unning),
R(estarted), s(uspended), S(uspended) or T(hreshold) (see the
Reduced Format section for detailed information),
o the start date and time and the function of the job (MASTER or SLAVE
- only meaningful in case of a parallel job) and
o the priority of the jobs.
ENVIRONMENTAL VARIABLES
SGE_ROOT Specifies the location of the Sun Grid Engine standard
configuration files.
SGE_CELL If set, specifies the default Sun Grid Engine cell. To
address a Sun Grid Engine cell qhost uses (in the order
of precedence):
The name of the cell specified in the environment
variable SGE_CELL, if it is set.
The name of the default cell, i.e. default.
SGE_DEBUG_LEVEL
If set, specifies that debug information should be
written to stderr. In addition the level of detail in
which debug information is generated is defined.
SGE_QMASTER_PORT
If set, specifies the tcp port on which sge_qmaster(8)
is expected to listen for communication requests. Most
installations will use a services map entry for the
service "sge_qmaster" instead to define that port.
FILES
<sge_root>/<cell>/common/act_qmaster
Sun Grid Engine master host file
SEE ALSO
sge_intro(1), qalter(1), qconf(1), qhold(1), qmod(1), qstat(1),
qsub(1), queue_conf(5), sge_execd(8), sge_qmaster(8), sge_shepherd(8).
COPYRIGHT
See sge_intro(1) for a full statement of rights and permissions.