DocumentCode :
1467931
Title :
Transformation-based Bayesian predictive classification using online prior evolution
Author :
Chien, Jen-Tzung ; Liao, Guo-Hong
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Volume :
9
Issue :
4
fYear :
2001
fDate :
5/1/2001 12:00:00 AM
Firstpage :
399
Lastpage :
410
Abstract :
The mismatch between training and testing environments makes the necessity of speech recognizers to be adaptive both in acoustic modeling and decision making. Accordingly, the speech hidden Markov models (HMMs) should be able to incrementally capture the evolving statistics of environments using online available data. Also, it is necessary for speech recognizers to exploit the robust decision strategy, which takes the uncertainty of parameters into account. This paper presents a transformation-based Bayesian predictive classification (TBPC) where the uncertainty of the transformation parameters of the HMM mean vector and precision matrix is adequately represented by a joint multivariate prior density of normal-Wishart belonging to the conjugate family. The formulation of TBPC decision is correspondingly constructed. Due to the benefit of conjugate density, we generate the reproducible prior/posterior pair such that the hyperparameters of prior density could evolve successively to new environments using online test/adaptation data. The evolved hyperparameters could suitably describe the parameter uncertainty for TBPC decision. Therefore, a novel framework of TBPC geared with online prior evolution (OPE) capability is developed for robust speech recognition. This framework is examined to be effective as well as efficient on the recognition task of connected Chinese digits in hands-free car environments
Keywords :
hidden Markov models; prediction theory; signal classification; speech recognition; HMM; HMM mean vector; acoustic modeling; conjugate density; conjugate family; connected Chinese digits recognition; decision making; hands-free car environments; hyperparameters; joint multivariate prior density; normal-Wishart; online available data; online prior evolution; online test/adaptation data; parameter uncertainty; parameters uncertainty; precision matrix; reproducible prior/posterior pair; robust decision; robust speech recognition; speech hidden Markov models; speech recognizers; testing environment; training environment; transformation parameters; transformation-based Bayesian predictive classification; Acoustic testing; Bayesian methods; Decision making; Hidden Markov models; Maximum likelihood estimation; Maximum likelihood linear regression; Robustness; Speech recognition; Statistics; Uncertainty;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.917685
Filename :
917685
Link To Document :
بازگشت