Man Linux: Main Page and Category List

NAME

       qhost - show the status of Sun Grid Engine hosts, queues, jobs

SYNTAX

       qhost  [ -cb ] [ -F [resource_name,...]  ] [ -help ] [ -h host_list ] [
       -j ] [ -l resource=val,...  ] [ -u user,...  ] [ -xml ].

DESCRIPTION

       qhost shows the current status of the available Sun Grid Engine  hosts,
       queues and the jobs associated with the queues. Selection options allow
       you to get information about specific hosts, queues, jobs or users.  If
       multiple  selections are done a host is only displayed if all selection
       criteria for a host are met.  Without any options qhost will display  a
       list of all hosts without queue or job information.

OPTIONS

       -F [ resource_name,... ]
              qhost  will  present  a detailed listing of the current resource
              availability per host with respect  to  all  resources  (if  the
              option  argument  is omitted) or with respect to those resources
              contained  in  the  resource_name  list.  Please  refer  to  the
              description  of  the Full Format in section OUTPUT FORMATS below
              for further detail.

       -cb    This command line switch can  be  used  since  Sun  Grid  Engine
              version  6.2u5  in  combination with other qhost(1) command line
              switches. In that case the output of the  corresponding  command
              will  contain  information  concerning  the  added  job  to core
              binding feature.

              If this switch is not  used  then  the  command  behaves  as  in
              version 6.2u4 and previous versions.

              If this option is used then two additional columns will be shown
              for each displayed host in the output. The first is  named  NSOC
              and represents the number of available sockets on that host. The
              second additional column is named NCOR  and  it  represents  the
              number   of   cores   that  are  available  per  socket  on  the
              corresponding machine.

       -help  Prints a listing of all options.

       -h host_list
              Prints a list of all hosts contained in host_list.

       -j     Prints all jobs running on the queues hosted by the shown hosts.
              This switch calls -q implicitly.

       -l resource[=value],...
              Defines the resources to be granted by the hosts which should be
              included in the host list output. Matching is performed on hosts
              based  on  non-mutable  resource  availability information only.
              That means load values are always ignored except  the  so-called
              static   load  values  (i.e.  "arch",  "num_proc",  "mem_total",
              "swap_total"  and   "virtual_total")   ones.   Also   consumable
              utilization  is  ignored.   If  there  are  multiple -l resource
              requests they will be concatenated by  a  logical  AND:  a  host
              needs to match all resources to be displayed.

       -q     Show  information  about  the  queues  instances  hosted  by the
              displayed hosts.

       -u user,...
              Display  information  only  on  those  jobs  and  queues   being
              associated with the users from the given user list.

       -xml   This  option  can be used with all other options and changes the
              output to XML. The  used  schemas  are  referenced  in  the  XML
              output. The output is printed to stdout.

              If  the  -xml parameter is combined with -cb then the XML output
              will contain additional tags containing information about job to
              core  binding.   You can find schema files with the suffix "_cb"
              in  the  directory  $SGE_ROOT/util/resources/schemas/qhost  that
              describe that changes.

OUTPUT FORMATS

       Depending  on  the  presence  or  absence of the -q or -F and -j option
       three output formats need to be differentiated.  PP

   Default Format (without -q, -F and -j)
       For each host one line is printed. The output consists of consisting of

       o  the Hostname

       o  the Architecture.

       o  the  Number of processors.

       o  the Load.

       o  the Total Memory.

       o  the Used Memory.

       o  the Total Swapspace.

       o  the Used Swapspace.

       If the -q option is supplied, each host status line also contains extra
       lines for every queue hosted by the host consisting of,

       o  the queue name.

       o  the queue type - one  of  B(atch),  I(nteractive),  C(heckpointing),
          P(arallel), T(ransfer) or combinations thereof,

       o  the number of reserved, used and available job slots,

       o  the  state  of  the  queue  -  one of u(nknown) if the corresponding
          sge_execd(8)  cannot  be  contacted,  a(larm),  A(larm),   C(alendar
          suspended),   s(uspended),  S(ubordinate),  d(isabled),  D(isabled),
          E(rror) or combinations thereof.

       If the state is a(alarm) at least one of the load thresholds defined in
       the load_thresholds list of the queue configuration (see queue_conf(5))
       is currently exceeded, which prevents from scheduling further  jobs  to
       that queue.

       As  opposed  to  this, the state A(larm) indicates that at least one of
       the suspend thresholds of the queue (see  queue_conf(5))  is  currently
       exceeded.  This  will  result  in  jobs  running  in  that  queue being
       successively suspended until no threshold is violated.

       The states s(uspended) and d(isabled) can be  assigned  to  queues  and
       released  via  the  qmod(1)  command. Suspending a queue will cause all
       jobs executing in that queue to be suspended.

       The states D(isabled) and C(alendar suspended) indicate that the  queue
       has  been disabled or suspended automatically via the calendar facility
       of Sun Grid Engine  (see  calendar_conf(5)),  while  the  S(ubordinate)
       state  indicates,  that the queue has been suspend via subordination to
       another queue (see queue_conf(5) for details). When suspending a  queue
       (regardless  of  the  cause)  all  jobs  executing  in  that  queue are
       suspended too.

       If an E(rror) state is displayed for a queue, sge_execd(8) on that host
       was  unable  to  locate  the sge_shepherd(8) executable on that host in
       order  to  start  a  job.  Please  check  the  error  logfile  of  that
       sge_execd(8) for leads on how to resolve the problem. Please enable the
       queue afterwards via the -c option of the qmod(1) command manually.

       If the -F option was used, resource availability information is printed
       following  the  host  status line. For each resource (as selected in an
       option argument to -F or for all resources if the option  argument  was
       omitted) a single line is displayed with the following format:

       o  a  one  letter  specifier  indicating  whether  the current resource
          availability value was dominated by either
          `g' - a cluster global,
          `h' - a host total or

       o  a second one letter specifier indicating the source for the  current
          resource availability value, being one of
          `l' - a load value reported for the resource,
          `L' - a load value for the resource after administrator defined load
          scaling has been applied,
          `c' - availability derived from the  consumable  resources  facility
          (see complexes(5)),
          `f'  - a fixed availability definition derived from a non-consumable
          complex attribute or a fixed resource limit.

       o  after a colon the name of  the  resource  on  which  information  is
          displayed.

       o  after an equal sign the current resource availability value.

       The  displayed  availability  values  and  the  sources from which they
       derive are always the minimum  values  of  all  possible  combinations.
       Hence,  for example, a line of the form "qf:h_vmem=4G" indicates that a
       queue currently has a maximum  availability  in  virtual  memory  of  4
       Gigabyte,  where  this value is a fixed value (e.g. a resource limit in
       the queue configuration) and it is queue dominated, i.e.  the  host  in
       total  may  have more virtual memory available than this, but the queue
       doesn't allow for more. Contrarily a  line  "hl:h_vmem=4G"  would  also
       indicate  an upper bound of 4 Gigabyte virtual memory availability, but
       the limit would be derived from a load value currently reported for the
       host.  So  while  the  queue  might  allow for jobs with higher virtual
       memory requirements, the host on which this  particular  queue  resides
       currently only has 4 Gigabyte available.

       After  the  queue  status line (in case of -j) a single line is printed
       for each job running currently in this  queue.  Each  job  status  line
       contains

       o  the job ID,

       o  the job name,

       o  the job owner name,

       o  the   status   of   the  job  -  one  of  t(ransfering),  r(unning),
          R(estarted),  s(uspended),  S(uspended)  or  T(hreshold)  (see   the
          Reduced Format section for detailed information),

       o  the start date and time and the function of the job (MASTER or SLAVE
          - only meaningful in case of a parallel job) and

       o  the priority of the jobs.

ENVIRONMENTAL VARIABLES

       SGE_ROOT       Specifies the location of the Sun Grid  Engine  standard
                      configuration files.

       SGE_CELL       If  set,  specifies the default Sun Grid Engine cell. To
                      address a Sun Grid Engine cell qhost uses (in the  order
                      of precedence):

                             The name of the cell specified in the environment
                             variable SGE_CELL, if it is set.

                             The name of the default cell, i.e. default.

       SGE_DEBUG_LEVEL
                      If set,  specifies  that  debug  information  should  be
                      written  to  stderr.  In addition the level of detail in
                      which debug information is generated is defined.

       SGE_QMASTER_PORT
                      If set, specifies the tcp port on  which  sge_qmaster(8)
                      is  expected to listen for communication requests.  Most
                      installations will use a  services  map  entry  for  the
                      service "sge_qmaster" instead to define that port.

FILES

       <sge_root>/<cell>/common/act_qmaster
                       Sun Grid Engine master host file

SEE ALSO

       sge_intro(1),   qalter(1),   qconf(1),   qhold(1),  qmod(1),  qstat(1),
       qsub(1), queue_conf(5), sge_execd(8), sge_qmaster(8),  sge_shepherd(8).

COPYRIGHT

       See sge_intro(1) for a full statement of rights and permissions.