DocumentCode
1020834
Title
A projection-based likelihood measure for speech recognition in noise
Author
Carlson, Beth A. ; Clements, Mark A.
Author_Institution
Dept. of Electr. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Volume
2
Issue
1
fYear
1994
Firstpage
97
Lastpage
102
Abstract
Investigates a projection-based likelihood measure that significantly improves automatic speech recognition performance in the presence of additive broadband noise. The measure was developed by modifying likelihood scores in continuous Gaussian density hidden Markov models (HMMs), resulting in the weighted projection measure (WPM). Experimental results using the proposed measure are reported for several performance factors: different cepstral-based parameters, normal and multistyle speech, and various noise signals, including white, jittering white, and broadband colored noise. In all cases, significant improvements in speaker-dependent, isolated word recognition were achieved using the WPM instead of the standard Gaussian likelihood measure (weighted Euclidean distance (WED)). As an example, at a SNR of 5 dB, the WPM resulted in improvement in recognition accuracy from 19.4 to 80.6% compared with the standard WED for the DFT mel-cepstral representation.
Keywords
acoustic noise; hidden Markov models; interference suppression; random noise; speech analysis and processing; speech recognition; WED; additive broadband noise; automatic speech recognition performance; broadband colored noise; cepstral-based parameters; continuous Gaussian density hidden Markov models; jittering white noise; multistyle speech; noise signals; normal speech; projection-based likelihood measure; speaker-dependent isolated word recognition; speech recognition; weighted Euclidean distance; weighted projection measure; white noise; Additive noise; Automatic speech recognition; Colored noise; Density measurement; Hidden Markov models; Noise measurement; Speech enhancement; Speech recognition; Weight measurement; White noise;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.260341
Filename
260341
Link To Document