• DocumentCode
    290377
  • Title

    Environment normalization for robust speech recognition using direct cepstral comparison

  • Author

    Hua Liu, Fu ; Stern, Richard M. ; Acero, Alejandro ; Moreno, Pedro J.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    ii
  • fYear
    1994
  • fDate
    19-22 Apr 1994
  • Abstract
    In this paper we describe and evaluate a series of new algorithms that compensate for the effects of unknown acoustical environments or changes in environment. The algorithms use compensation vectors that are added to the cepstral representations of speech that is input to a speech recognition system. While these vectors are computed from direct frame-by-frame comparisons of cepstra of speech simultaneously recorded in the training environment and various prototype testing environments, the compensation algorithms do not assume that the acoustical characteristics of the actual testing environment are known. The specific compensation vector applied in a given frame depends on either physical attributes such as SNR or presumed phonetic identity. The compensation algorithms are evaluated using the 1992 ARPA 5000 word WSJ/CSR corpus. The best system combines phoneme-based and SNR-based cepstral compensation with cepstral mean normalization, and provides a 66.8% reduction in error rate over baseline processing when tested using a standard suite of unknown microphones
  • Keywords
    cepstral analysis; error compensation; normalising; speech processing; speech recognition; vectors; ARPA 5000 word WSJ/CSR corpus; SNR; acoustical environments; algorithms; baseline processing; cepstral mean normalization; compensation algorithms; compensation vectors; direct cepstral comparison; direct frame-by-frame comparisons; environment normalization; error rate; phonetic identity; prototype testing environments; robust speech recognition; training environment; Acoustic distortion; Acoustic testing; Cepstral analysis; Maximum likelihood detection; Microphone arrays; Robustness; Speech analysis; Speech recognition; Vectors; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
  • Conference_Location
    Adelaide, SA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-1775-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1994.389718
  • Filename
    389718