DocumentCode :
1146756
Title :
Linear regression based Bayesian predictive classification for speech recognition
Author :
Chien, Jen-Tzung
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Volume :
11
Issue :
1
fYear :
2003
fDate :
1/1/2003 12:00:00 AM
Firstpage :
70
Lastpage :
79
Abstract :
The uncertainty in parameter estimation due to the adverse environments deteriorates the classification performance for speech recognition. It becomes crucial to incorporate the parameter uncertainty into decision so that the classification robustness can be assured. We propose a novel linear regression based Bayesian predictive classification (LRBPC) for robust speech recognition. This framework is constructed under the paradigm of linear regression adaptation of speech hidden Markov models (HMMs). Because the regression mapping between HMMs and adaptation data is ill posed, we properly characterize the uncertainty of regression parameters using a joint Gaussian distribution . A closed-form predictive distribution can be derived to set up the LRBPC decision for speech recognition. Such decision is robust compared to the plug-in maximum a posteriori (MAP) decision adopted in the maximum likelihood linear regression (MLLR) and MAP linear regression (MAPLR). Since the specified distribution belongs to the conjugate prior family, the evolutionary hyperparameters are established. With the statistically rich hyperparameters, the LRBPC achieves decision robustness. In the experiments, we find that LRBPC decision in cases of general linear regression as well as single variable linear regression attains significantly better recognition performance than MLLR and MAPLR adaptation.
Keywords :
Bayes methods; Gaussian distribution; hidden Markov models; prediction theory; signal classification; speech recognition; statistical analysis; Bayesian predictive classification; HMM; MAP linear regression; classification performance; classification robustness; closed-form predictive distribution; conjugate prior family; evolutionary hyperparameters; hidden Markov models; ill posed adaptation data; joint Gaussian distribution; linear regression; linear regression speech adaptation; maximum a posteriori decision; maximum likelihood linear regression; parameter estimation uncertainty; recognition performance; regression parameters; speech recognition; Bayesian methods; Gaussian distribution; Hidden Markov models; Linear regression; Maximum likelihood linear regression; Parameter estimation; Robustness; Speech recognition; Uncertain systems; Uncertainty;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2002.805640
Filename :
1179381
Link To Document :
بازگشت