Title :
Cooperative modular neural predictive coding
Author :
Chetouani, M. ; Gas, B. ; Zarader, J.L.
Author_Institution :
Lab. des Instrum. et Syst. d´´Ile-De-France, Paris VI Univ., France
Abstract :
Speech feature extraction is one of the most important stage in the speech recognition process. In this paper, we propose a new neural networks architecture called the cooperative modular neural predictive coding (CMNPC). It is based on the interaction of discriminant experts DFE-NPC (discriminant feature extraction) optimized for macro-classification by the help of a criterion: the modelisation error ratio (MER). We propose a theoretical validation of this model by linking The MER with a likelihood ratio. The performances of this architecture are estimated in a phoneme recognition task. The phonemes are extracted from the Darpa-Timit speech database. Comparisons with coding methods (LPC, MFCC, PLP) are presented. They put in obviousness an improvement of the recognition rates.
Keywords :
audio databases; cepstral analysis; error statistics; feature extraction; linear predictive coding; neural net architecture; speech recognition; Darpa-Timit speech database; Mel frequency cepstral coding; coding methods; cooperative modular neural predictive coding; discriminant feature extraction; linear predictive coding; macroclassification; neural networks architecture; perceptual linear predictive coding; phoneme recognition task; speech feature extraction; speech recognition process; Cepstral analysis; Feature extraction; Independent component analysis; Linear predictive coding; Mel frequency cepstral coefficient; Neural networks; Predictive coding; Predictive models; Speech processing; Speech recognition;
Conference_Titel :
Neural Networks for Signal Processing, 2003. NNSP'03. 2003 IEEE 13th Workshop on
Print_ISBN :
0-7803-8177-7
DOI :
10.1109/NNSP.2003.1318063