DocumentCode
352357
Title
PCA-PMC: a novel use of a priori knowledge for fast parallel model combination
Author
Sarikaya, Ruhi ; Hansen, John H L
Author_Institution
Robust Speech Process. Lab., Colorado Univ., Boulder, CO, USA
Volume
2
fYear
2000
fDate
2000
Abstract
This paper describes an algorithm to reduce computational complexity of the parallel model combination (PMC) method for robust speech recognition while retaining the same level of performance. Although, PMC is effective in composing a noise corrupted acoustic model from clean speech and noise models, the intense computational complexity limits its use in real-time use. The novel approach here is to encode the clean models using principal component analysis (PCA) and pre-compute the prototype vectors and matrices for the means and covariances in the linear spectral-domain using rectangular DCT and inverse DCT matrices. Therefore, transformation into the linear spectral domain is reduced to finding the projection of each vector in the eigen space of means and covariances followed by a linear combination of vectors and matrices obtained from the projections. Furthermore, the eigen space allows a better trade-off for reducing computational complexity versus accuracy. The computational savings are demonstrated both analytically and through experimental evaluations. Experiments using context independent phone recognition with TIMIT data shows that the new PMC framework can outperforms the baseline method by a factor of 1.9 with the same level of accuracy
Keywords
computational complexity; discrete cosine transforms; eigenvalues and eigenfunctions; inverse problems; matrix algebra; principal component analysis; spectral-domain analysis; speech coding; speech recognition; PCA-PMC; a priori knowledge; clean speech; computational complexity; covariances; eigenspace; fast parallel model combination; inverse DCT matrices; linear spectral domain; linear spectral-domain; matrices; mean; noise corrupted acoustic model; noise models; performance; phone recognition; principal component analysis; projection; prototype vectors; protype vectors; rectangular DCT matrices; robust speech recognition; transformation; Acoustic noise; Computational complexity; Computational modeling; Covariance matrix; Discrete cosine transforms; Noise robustness; Principal component analysis; Speech enhancement; Speech recognition; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1520-6149
Print_ISBN
0-7803-6293-4
Type
conf
DOI
10.1109/ICASSP.2000.859159
Filename
859159
Link To Document