DocumentCode :
900248
Title :
Aggregate a posteriori linear regression adaptation
Author :
Chien, Jen-Tzung ; Huang, Chih-Hsien
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Volume :
14
Issue :
3
fYear :
2006
fDate :
5/1/2006 12:00:00 AM
Firstpage :
797
Lastpage :
807
Abstract :
We present a new discriminative linear regression adaptation algorithm for hidden Markov model (HMM) based speech recognition. The cluster-dependent regression matrices are estimated from speaker-specific adaptation data through maximizing the aggregate a posteriori probability, which can be expressed in a form of classification error function adopting the logarithm of posterior distribution as the discriminant function. Accordingly, the aggregate a posteriori linear regression (AAPLR) is developed for discriminative adaptation where the classification errors of adaptation data are minimized. Because the prior distribution of regression matrix is involved, AAPLR is geared with the Bayesian learning capability. We demonstrate that the difference between AAPLR discriminative adaptation and maximum a posteriori linear regression (MAPLR) adaptation is due to the treatment of the evidence. Different from minimum classification error linear regression (MCELR), AAPLR has closed-form solution to fulfil rapid adaptation. Experimental results reveal that AAPLR speaker adaptation does improve speech recognition performance with moderate computational cost compared to maximum likelihood linear regression (MLLR), MAPLR, MCELR and conditional maximum likelihood linear regression (CMLLR). These results are verified for supervised adaptation as well as unsupervised adaptation for different numbers of adaptation data.
Keywords :
belief networks; hidden Markov models; learning (artificial intelligence); matrix algebra; maximum likelihood estimation; regression analysis; speech recognition; Bayesian learning; HMM; aggregate a posteriori linear regression adaptation; cluster-dependent regression matrices; discriminative linear regression adaptation algorithm; hidden Markov model; maximum a posteriori linear regression; speaker-specific adaptation data; speech recognition; unsupervised adaptation; Aggregates; Bayesian methods; Clustering algorithms; Hidden Markov models; Linear regression; Maximum likelihood estimation; Maximum likelihood linear regression; Natural languages; Robustness; Speech recognition; Aggregate a posteriori criterion; Bayesian learning; discriminative adaptation; linear regression adaptation; speaker adaptation; speech recognition;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TSA.2005.860847
Filename :
1621195
Link To Document :
بازگشت