Title :
A new robust LPCCS for speech recogniton in channel distortion environments
Author :
He, Yongjun ; Han, Jiqing
Author_Institution :
Speech Process. Lab., Harbin Inst. of Technol., Harbin, China
Abstract :
Linear prediction (LP) analysis plays an important role in speech processing. linear prediction cepstral coefficients (LPCCs) are proved to be better features for speech recognition. In LP analysis, the generation of speech is modeled with an autoregressive (AR) random process. In fact, speech from vocal cord goes through vocal tract first, and then goes through channel. However, general LP analysis does not consider the effect of channel and the LPCCs show little robustness in speech recognition under channel distortion. In this paper, distorted speech is modeled by two different AR models which denote vocal tract and channel, respectively. After the poles are computed, K-means is used to cluster all poles and a new method is proposed to choose Channel poles. For each frame of test utterances, instead of designing a filter using channel poles, robust LPCCs are computed directly by removing channel poles from all poles. Experiment shows that this approach can obtain a better performance than general methods.
Keywords :
autoregressive processes; filtering theory; speech recognition; K-means method; autoregressive random process; channel distortion environments; channel poles; filter design; linear prediction analysis; linear prediction cepstral Coefficients; speech processing; speech recognition; Cepstral analysis; Cybernetics; Machine learning; Nonlinear distortion; Predictive models; Robustness; Speech analysis; Speech processing; Speech recognition; Testing; Channel distortion; Feature extraction; Linear prediction; Speech recognition; Tracking;
Conference_Titel :
Machine Learning and Cybernetics, 2009 International Conference on
Conference_Location :
Baoding
Print_ISBN :
978-1-4244-3702-3
Electronic_ISBN :
978-1-4244-3703-0
DOI :
10.1109/ICMLC.2009.5212203