Title :
Robustness of speech recognition using genetic algorithms and a Mel-cepstral subspace approach
Author :
Selouani, S.A. ; O´Shaughnessy, D.
Author_Institution :
Univ. de Moncton, Campus De Shippagan, Canada
Abstract :
The paper presents a method to compensate Mel-frequency cepstral coefficients (MFCCs) for a HMM-based speech recognition system evolving under telephone-channel degradations. The technique we propose is based on the combination of the Karhonen-Loeve transform (KLT) and genetic algorithms (GA). The idea consists of projecting the band-limited MFCCs onto a subspace generated by the genetically optimized KLT principal axes. Experiments show a clear improvement when the method is applied to the NTIMIT telephone speech database. Word recognition results obtained on the HTK toolkit platform using N-mixture triphone models and a bigram language model are presented and discussed.
Keywords :
Karhunen-Loeve transforms; genetic algorithms; hidden Markov models; speech recognition; telephony; HMM; Karhonen-Loeve transform; MFCC; Mel-cepstral subspace approach; Mel-frequency cepstral coefficients; bigram language model; genetic algorithms; speech recognition robustness; telephone-channel degradation; triphone models; word recognition; Acoustic testing; Additive noise; Degradation; Genetic algorithms; Karhunen-Loeve transforms; Noise generators; Robustness; Speech recognition; Telephony; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1325957