DocumentCode
699445
Title
Learning Vector Quantization and Neural Predictive Coding for nonlinear speech feature extraction
Author
Chetouani, M. ; Gas, B. ; Zarader, J.L.
Author_Institution
Lab. des Instrum. et Syst. d´Ile-De-France, Univ. Paris VI, Paris, France
fYear
2004
fDate
6-10 Sept. 2004
Firstpage
2059
Lastpage
2062
Abstract
Speech recognition is a special field of pattern recognition. In order to improve the performances of the systems, one can opt for several ways and among them the design of a feature extractor. This paper presents a new nonlinear feature extraction method based on the Learning Vector Quantization (LVQ) and the Neural Predictive Coding (NPC). The key idea of this work is to design a feature extractor, the NPC, by the introduction of discriminant constraint provided by the LVQ classifier. The performances are estimated on a phoneme classification task by several methods: GMM, MLP, LVQ. The phonemes are extracted from the NTIMIT database. We make comparisons with linear and nonlinear feature transformation methods (LDA, PCA, NLDA, NPCA), and also with coding methods (LPC, MFCC, PLP).
Keywords
Gaussian processes; feature extraction; mixture models; signal classification; speech coding; vector quantisation; GMM; Gaussians mixture model; LVQ classifier; NTIMIT database; learning vector quantization; neural predictive coding; nonlinear feature transformation method; nonlinear speech feature extraction method; phoneme classification task; Abstracts; Adaptation models; Databases; Loss measurement; Mel frequency cepstral coefficient; Predictive models; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2004 12th European
Conference_Location
Vienna
Print_ISBN
978-320-0001-65-7
Type
conf
Filename
7079975
Link To Document