DocumentCode
498850
Title
A new robust LPCCS for speech recogniton in channel distortion environments
Author
He, Yongjun ; Han, Jiqing
Author_Institution
Speech Process. Lab., Harbin Inst. of Technol., Harbin, China
Volume
4
fYear
2009
fDate
12-15 July 2009
Firstpage
2358
Lastpage
2362
Abstract
Linear prediction (LP) analysis plays an important role in speech processing. linear prediction cepstral coefficients (LPCCs) are proved to be better features for speech recognition. In LP analysis, the generation of speech is modeled with an autoregressive (AR) random process. In fact, speech from vocal cord goes through vocal tract first, and then goes through channel. However, general LP analysis does not consider the effect of channel and the LPCCs show little robustness in speech recognition under channel distortion. In this paper, distorted speech is modeled by two different AR models which denote vocal tract and channel, respectively. After the poles are computed, K-means is used to cluster all poles and a new method is proposed to choose Channel poles. For each frame of test utterances, instead of designing a filter using channel poles, robust LPCCs are computed directly by removing channel poles from all poles. Experiment shows that this approach can obtain a better performance than general methods.
Keywords
autoregressive processes; filtering theory; speech recognition; K-means method; autoregressive random process; channel distortion environments; channel poles; filter design; linear prediction analysis; linear prediction cepstral Coefficients; speech processing; speech recognition; Cepstral analysis; Cybernetics; Machine learning; Nonlinear distortion; Predictive models; Robustness; Speech analysis; Speech processing; Speech recognition; Testing; Channel distortion; Feature extraction; Linear prediction; Speech recognition; Tracking;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2009 International Conference on
Conference_Location
Baoding
Print_ISBN
978-1-4244-3702-3
Electronic_ISBN
978-1-4244-3703-0
Type
conf
DOI
10.1109/ICMLC.2009.5212203
Filename
5212203
Link To Document