NAME
vilistextum - html to ascii converter
SYNOPSIS
vilistextum [OPTIONS] [inputfile |-] [outputfile | -]
DESCRIPTION
vilistextum is a html to ascii converter specifically programmed to get
the best out of incorrect html.
OPTIONS
inputfile,- resp. outputfile,-
replace inputfile with ’-’ for reading from standard input,
likewise outputfile with ’-’ for writing to standard output.
-a, --no-alt
don’t output anything for IMG tags even if they have an ALT
attribute. Implies --no-image.
-c, --convert-tags
some tags will be converted to special characters.
-e, --errorlevel NUMBER
increase level of verbosity for error messages (0: No error
messages).
-i, --defimage STRING
IMG tags without alt attribute are output as [STRING].
-l, --links
numbers the links in the document and creates footnotes of each
link at the end of the file.
-k, --links-inline
print the links directly after the html tag.
-m, --dont-convert-characters
don’t convert the entities from windows1252 (€-Ÿ and
their proper entity names)
-n, --no-image
don’t output [Image] for IMG tags that have no ALT attribute.
-p, --palm
output text more suitable for reading on a PDA.
-r, --remove-empty-alt
if there is an empty ALT attribute in a IMG tag (eg <IMG
href="..." alt="">), don’t output ’[]’.
-s, --shrink-lines [NUMBER]
if there are more than NUMBER empty lines, output only NUMBER.
Default: 1.
-t, --no-title
don’t output title.
-w, --width NUMBER
maximum line width.
-h, --help
display this help and exit
-v, --version
output version information and exit
MULTIBYTE OPTIONS (Only available if compiled with multibyte support)
-u, --output-utf-8
instead of the character set of the html document, everything
will be output as utf-8.
-x, --translit
use the //TRANSLIT feature of libiconv. Consult the iconv manual
for details.
-y, --charset CHARSET
if the HTML document doesn’t provide a character set in the meta
tags, use CHARSET.
LIMITATIONS
The rendering of tables is not very good.
The handling of OL is incomplete. The program treats it as UL and more
than 10 nested lists confuse it.
Text is never justified.
REPORTING BUGS
Please report bugs to <bhaak@gmx.net>.
AUTHOR
Vilistextum was written by Patric Mueller <bhaak@gmx.net> and may be
freely distributed under the terms of the GNU General Public License
Version 2. There is ABSOLUTELY NO WARRANTY for this program.
SEE ALSO
iconv(3), lynx(1), links(1), w3m(1)
22 OCT 2006