DocumentCode
323560
Title
Improved phone recognition using Bayesian triphone models
Author
Ming, Ji ; Smith, F. Jack
Author_Institution
Sch. of Electr. Eng. & Comput. Sci., Queen´´s Univ., Belfast, UK
Volume
1
fYear
1998
fDate
12-15 May 1998
Firstpage
409
Abstract
A crucial issue in triphone based continuous speech recognition is the large number of models to be estimated against the limited availability of training data. This problem can be relieved by composing a triphone model from less context-dependent models. This paper introduces a new statistical framework, derived from the Bayesian principle, to perform such a composition. The potential power of this new framework is explored, both algorithmically and experimentally, by an implementation with hidden Markov modeling techniques. This implementation is applied to the recognition of the 39-phone set on the TIMIT database. The new model achieves 74.4% and 75.6% accuracy, respectively, on the core and complete test sets
Keywords
Bayes methods; hidden Markov models; speech recognition; Bayesian triphone models; TIMIT database; continuous speech recognition; hidden Markov modeling techniques; less context-dependent models; phone recognition; statistical framework; Availability; Bayesian methods; Computer science; Context modeling; Databases; Hidden Markov models; Interpolation; Lifting equipment; Speech recognition; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.674454
Filename
674454
Link To Document