A Toolkit for Statistical Language Modeling, Text Retrieval, Classification and Clustering, (1996)