Title :
Experiments with linear and nonlinear feature transformations in HMM based phone recognition
Abstract :
Feature extraction is the key element when aiming at robust speech recognition. Both linear and nonlinear data-driven feature transformations are applied to the logarithmic mel-spectral context feature vectors in the TIMIT phone recognition task. Transformations are based on principal component analysis (PCA), independent component analysis (ICA), linear discriminant analysis (LDA) and multilayer perceptron network based nonlinear discriminant analysis (NLDA). All four methods outperform the baseline system which consists of the standard feature representation based on MFCCs (mel-frequency cepstral coefficients) with the first-order deltas, using a mixture-of-Gaussians HMM recognizer. Further improvement is gained by forming the feature vector as a concatenation of the outputs of all four feature transformations.
Keywords :
feature extraction; hidden Markov models; independent component analysis; multilayer perceptrons; principal component analysis; speech recognition; MFCC; TIMIT phone recognition task; feature extraction; independent component analysis; linear discriminant analysis; linear feature transformations; logarithmic mel-spectral context feature vectors; mixture-of-Gaussians HMM recognizer; multilayer perceptron network; nonlinear discriminant analysis; nonlinear feature transformations; principal component analysis; robust speech recognition; Cepstral analysis; Feature extraction; Hidden Markov models; Independent component analysis; Linear discriminant analysis; Multilayer perceptrons; Principal component analysis; Robustness; Speech recognition; Vectors;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198714