genpyt - generate the PINYIN lexicon
genpyt lexicon-file result-file log-file slm-file
genpyt is used to generate the PINYIN lexicon. It only works on
Specify a dictionary file. It should be a line-based text file in
utf-8 encoding . Each line looks like:
CCC id [pinyin'pinyin'pinyin]*
A default dictionary file can be found at
The output binary PINYIN lexicon file. This lexicon contains a trie
presenting the key tree of PINYIN. And all of the candiate words
are sorted using the unigram in slm-file. This file can be used
with sunpinyin input method engines.
Specify the file to where the log goes. The log-file can be seen as
the human-readble presentation of the binary output file.
The language model from which the unigram information are
retrieved. Typically, the slm-file is generated by slmthread.
Originally written by Phill.Zhang <firstname.lastname@example.org>. Currently
maintained by Kov.Chai <email@example.com>.