DocumentCode :
352357
Title :
PCA-PMC: a novel use of a priori knowledge for fast parallel model combination
Author :
Sarikaya, Ruhi ; Hansen, John H L
Author_Institution :
Robust Speech Process. Lab., Colorado Univ., Boulder, CO, USA
Volume :
2
fYear :
2000
fDate :
2000
Abstract :
This paper describes an algorithm to reduce computational complexity of the parallel model combination (PMC) method for robust speech recognition while retaining the same level of performance. Although, PMC is effective in composing a noise corrupted acoustic model from clean speech and noise models, the intense computational complexity limits its use in real-time use. The novel approach here is to encode the clean models using principal component analysis (PCA) and pre-compute the prototype vectors and matrices for the means and covariances in the linear spectral-domain using rectangular DCT and inverse DCT matrices. Therefore, transformation into the linear spectral domain is reduced to finding the projection of each vector in the eigen space of means and covariances followed by a linear combination of vectors and matrices obtained from the projections. Furthermore, the eigen space allows a better trade-off for reducing computational complexity versus accuracy. The computational savings are demonstrated both analytically and through experimental evaluations. Experiments using context independent phone recognition with TIMIT data shows that the new PMC framework can outperforms the baseline method by a factor of 1.9 with the same level of accuracy
Keywords :
computational complexity; discrete cosine transforms; eigenvalues and eigenfunctions; inverse problems; matrix algebra; principal component analysis; spectral-domain analysis; speech coding; speech recognition; PCA-PMC; a priori knowledge; clean speech; computational complexity; covariances; eigenspace; fast parallel model combination; inverse DCT matrices; linear spectral domain; linear spectral-domain; matrices; mean; noise corrupted acoustic model; noise models; performance; phone recognition; principal component analysis; projection; prototype vectors; protype vectors; rectangular DCT matrices; robust speech recognition; transformation; Acoustic noise; Computational complexity; Computational modeling; Covariance matrix; Discrete cosine transforms; Noise robustness; Principal component analysis; Speech enhancement; Speech recognition; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.859159
Filename :
859159
Link To Document :
بازگشت