This is a implementation of the TextCat algorithm, the text categorisation algorithm based on n-gram frequency.
The code does not include the ngram files which you will have to download from the original textcat tarball.
The n-gram classification code can be downloaded here
Comments !