DocumentCode :
3630887
Title :
Improved language modelling by unsupervised acquisition of structure
Author :
K. Ries; Finn Dag Buo; Ye-Yi Wang
Author_Institution :
Karlsruhe Univ., Germany
Volume :
1
fYear :
1995
Firstpage :
193
Abstract :
The perplexity of corpora is typically reduced by more than 30% compared to advanced n-gram models by a new method for the unsupervised acquisition of structural text models. This method is based on new algorithms for the classification of words and phrases from context and on new sequence finding procedures. These procedures are designed to work fast and accurately on small and large corpora. They are iterated to build a structural model of a corpus. The structural model can be applied to recalculate the scores of a speech recogniser and improves the word accuracy. Further applications such as preprocessing for neural networks and (hidden) Markov models in language processing, which exploit the structure finding capabilities of this model, are proposed.
Keywords :
"Hidden Markov models","Interactive systems","Cities and towns","Laboratories","Speech recognition","Neural networks","Testing","Councils","Lattices","Scheduling"
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479397
Filename :
479397
Link To Document :
بازگشت