---------------------------------------------
What is this?
---------------------------------------------
wsj5k.DMP is a 5000 word language model suitable for use with Sphinx-3 and
Sphinx-4.  The file is in the CMU binary (DMP) format. It contains:

    4,988 unigrams
1,529,984 bigrams
7,851,482 trigrams

This language model was created using the CMU-Cambridge Statistical Language
Modeling toolkit and data from the LDC. This is a closed vocabulary model.
Good-Turing discounting was applied.
