• DocumentCode
    352357
  • Title

    PCA-PMC: a novel use of a priori knowledge for fast parallel model combination

  • Author

    Sarikaya, Ruhi ; Hansen, John H L

  • Author_Institution
    Robust Speech Process. Lab., Colorado Univ., Boulder, CO, USA
  • Volume
    2
  • fYear
    2000
  • fDate
    2000
  • Abstract
    This paper describes an algorithm to reduce computational complexity of the parallel model combination (PMC) method for robust speech recognition while retaining the same level of performance. Although, PMC is effective in composing a noise corrupted acoustic model from clean speech and noise models, the intense computational complexity limits its use in real-time use. The novel approach here is to encode the clean models using principal component analysis (PCA) and pre-compute the prototype vectors and matrices for the means and covariances in the linear spectral-domain using rectangular DCT and inverse DCT matrices. Therefore, transformation into the linear spectral domain is reduced to finding the projection of each vector in the eigen space of means and covariances followed by a linear combination of vectors and matrices obtained from the projections. Furthermore, the eigen space allows a better trade-off for reducing computational complexity versus accuracy. The computational savings are demonstrated both analytically and through experimental evaluations. Experiments using context independent phone recognition with TIMIT data shows that the new PMC framework can outperforms the baseline method by a factor of 1.9 with the same level of accuracy
  • Keywords
    computational complexity; discrete cosine transforms; eigenvalues and eigenfunctions; inverse problems; matrix algebra; principal component analysis; spectral-domain analysis; speech coding; speech recognition; PCA-PMC; a priori knowledge; clean speech; computational complexity; covariances; eigenspace; fast parallel model combination; inverse DCT matrices; linear spectral domain; linear spectral-domain; matrices; mean; noise corrupted acoustic model; noise models; performance; phone recognition; principal component analysis; projection; prototype vectors; protype vectors; rectangular DCT matrices; robust speech recognition; transformation; Acoustic noise; Computational complexity; Computational modeling; Covariance matrix; Discrete cosine transforms; Noise robustness; Principal component analysis; Speech enhancement; Speech recognition; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.859159
  • Filename
    859159