مرکز منطقه ای اطلاع رساني علوم و فناوري - PCA-PMC: a novel use of a priori knowledge for fast parallel model combination

DocumentCode :

352357

Title :

PCA-PMC: a novel use of a priori knowledge for fast parallel model combination

Author :

Sarikaya, Ruhi ; Hansen, John H L

Author_Institution :

Robust Speech Process. Lab., Colorado Univ., Boulder, CO, USA

Volume :

fYear :

2000

fDate :

2000

Abstract :

This paper describes an algorithm to reduce computational complexity of the parallel model combination (PMC) method for robust speech recognition while retaining the same level of performance. Although, PMC is effective in composing a noise corrupted acoustic model from clean speech and noise models, the intense computational complexity limits its use in real-time use. The novel approach here is to encode the clean models using principal component analysis (PCA) and pre-compute the prototype vectors and matrices for the means and covariances in the linear spectral-domain using rectangular DCT and inverse DCT matrices. Therefore, transformation into the linear spectral domain is reduced to finding the projection of each vector in the eigen space of means and covariances followed by a linear combination of vectors and matrices obtained from the projections. Furthermore, the eigen space allows a better trade-off for reducing computational complexity versus accuracy. The computational savings are demonstrated both analytically and through experimental evaluations. Experiments using context independent phone recognition with TIMIT data shows that the new PMC framework can outperforms the baseline method by a factor of 1.9 with the same level of accuracy

Keywords :

computational complexity; discrete cosine transforms; eigenvalues and eigenfunctions; inverse problems; matrix algebra; principal component analysis; spectral-domain analysis; speech coding; speech recognition; PCA-PMC; a priori knowledge; clean speech; computational complexity; covariances; eigenspace; fast parallel model combination; inverse DCT matrices; linear spectral domain; linear spectral-domain; matrices; mean; noise corrupted acoustic model; noise models; performance; phone recognition; principal component analysis; projection; prototype vectors; protype vectors; rectangular DCT matrices; robust speech recognition; transformation; Acoustic noise; Computational complexity; Computational modeling; Covariance matrix; Discrete cosine transforms; Noise robustness; Principal component analysis; Speech enhancement; Speech recognition; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on

Conference_Location :

Istanbul

ISSN :

1520-6149

Print_ISBN :

0-7803-6293-4

Type :

conf

DOI :

10.1109/ICASSP.2000.859159

Filename :

859159

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=352357