Title :
Transform-based multi-feature optimization for robust distributed speech recognition
Author :
Addou, D. ; Selouani, S.A. ; Boudraa, M. ; Boudraa, B.
Author_Institution :
Speech & Signal Process. Lab., Univ. of Sci. & Technol., Houari Boumediene (USTHB), Algiers, Algeria
Abstract :
This paper describes a noise-robust Distributed Speech Recognition (DSR) front-end using a combination of conventional Mel-cepstral Coefficient (MFCC) and Line Spectral Frequencies (LSF). These features are adequately transformed and reduced in a multi-stream scheme using Karhunen-Loeve Transform (KLT). We investigate the performance of a new front-end DSR in terms of recognition accuracy in adverse conditions as well as in terms of dimensionality reduction. Our results showed that for highly noisy speech, the proposed transformation scheme leads to a significant improvement in recognition accuracy on Aurora 2 task.
Keywords :
Karhunen-Loeve transforms; cepstral analysis; feature extraction; optimisation; speech recognition; DSR front-end; Karhunen-Loeve transform; LSF; MFCC; Mel-cepstral coefficient; dimensionality reduction; line spectral frequency; multistream scheme; noise-robust distributed speech recognition front-end; transform-based multifeature optimization; Acoustics; Feature extraction; Noise measurement; Robustness; Speech; Speech recognition; Telecommunication standards; Distributed speech recognition; KLT; LSF; MFCC; multi-stream paradigm;
Conference_Titel :
GCC Conference and Exhibition (GCC), 2011 IEEE
Conference_Location :
Dubai
Print_ISBN :
978-1-61284-118-2
DOI :
10.1109/IEEEGCC.2011.5752586