Man Linux: Main Page and Category List

NAME

       hxextract - extract selected elements from a HTML or XML file

SYNOPSIS

       hxextract  [  -h  |  -?  ]  [  -x ] [ -s text ] [ -e text ] [ -b base ]
       element-or-class [ -c configfile | file-or-URL ]

DESCRIPTION

       hxextract outputs all elements with a certain name and/or class.

       Input must be well-formed, since no HTML heuristics are applied.

OPTIONS

       The following options are supported:

       -x        Use XML format conventions.

       -s text   Insert text at the start of the output.

       -e text   Insert text at the end of the output.

       -b base   URL base

       -c configfile
                 Read @chapter lines from configfile (lines  must  be  of  the
                 form  "@chapter  filename") and extract elements from each of
                 those files.

       -h, -?    Print command usage.

OPERANDS

       The following operands are supported:

       element-or-class
                 The name of an element to extract (e.g., "H2"), or  the  name
                 of   a   class  preceded  by  "."  (e.g.,  ".example")  or  a
                 combination of both (e.g., "H2.example").

       file-or-URL
                 A file name or a URL. To read from standard input, use "-".

SEE ALSO

       hxselect(1)