• DocumentCode
    1020834
  • Title

    A projection-based likelihood measure for speech recognition in noise

  • Author

    Carlson, Beth A. ; Clements, Mark A.

  • Author_Institution
    Dept. of Electr. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
  • Volume
    2
  • Issue
    1
  • fYear
    1994
  • Firstpage
    97
  • Lastpage
    102
  • Abstract
    Investigates a projection-based likelihood measure that significantly improves automatic speech recognition performance in the presence of additive broadband noise. The measure was developed by modifying likelihood scores in continuous Gaussian density hidden Markov models (HMMs), resulting in the weighted projection measure (WPM). Experimental results using the proposed measure are reported for several performance factors: different cepstral-based parameters, normal and multistyle speech, and various noise signals, including white, jittering white, and broadband colored noise. In all cases, significant improvements in speaker-dependent, isolated word recognition were achieved using the WPM instead of the standard Gaussian likelihood measure (weighted Euclidean distance (WED)). As an example, at a SNR of 5 dB, the WPM resulted in improvement in recognition accuracy from 19.4 to 80.6% compared with the standard WED for the DFT mel-cepstral representation.
  • Keywords
    acoustic noise; hidden Markov models; interference suppression; random noise; speech analysis and processing; speech recognition; WED; additive broadband noise; automatic speech recognition performance; broadband colored noise; cepstral-based parameters; continuous Gaussian density hidden Markov models; jittering white noise; multistyle speech; noise signals; normal speech; projection-based likelihood measure; speaker-dependent isolated word recognition; speech recognition; weighted Euclidean distance; weighted projection measure; white noise; Additive noise; Automatic speech recognition; Colored noise; Density measurement; Hidden Markov models; Noise measurement; Speech enhancement; Speech recognition; Weight measurement; White noise;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.260341
  • Filename
    260341