Title :
Improved language modelling by unsupervised acquisition of structure
Author :
K. Ries; Finn Dag Buo; Ye-Yi Wang
Author_Institution :
Karlsruhe Univ., Germany
Abstract :
The perplexity of corpora is typically reduced by more than 30% compared to advanced n-gram models by a new method for the unsupervised acquisition of structural text models. This method is based on new algorithms for the classification of words and phrases from context and on new sequence finding procedures. These procedures are designed to work fast and accurately on small and large corpora. They are iterated to build a structural model of a corpus. The structural model can be applied to recalculate the scores of a speech recogniser and improves the word accuracy. Further applications such as preprocessing for neural networks and (hidden) Markov models in language processing, which exploit the structure finding capabilities of this model, are proposed.
Keywords :
"Hidden Markov models","Interactive systems","Cities and towns","Laboratories","Speech recognition","Neural networks","Testing","Councils","Lattices","Scheduling"
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479397