Title :
Incorporating voice onset time to improve letter recognition accuracies
Author :
Niyogi, Partha ; Ramesh, Padma
Author_Institution :
Lucent Technol., AT&T Bell Labs., Murray Hill, NJ, USA
Abstract :
We consider the possibility of incorporating distinctive features into a statistically based speech recognizer. We develop a two pass strategy for recognition with a standard HMM based first pass followed by a second pass that performs an alternative analysis to extract class-specific features. For the voiced/voiceless distinction on stops for an alphabet recognition task, we show that a linguistically motivated acoustic feature exists (the VOT), provides superior separability to standard spectral measures, and can be automatically extracted from the signal to reduce error rates by 48.7% over state of the art HMM systems
Keywords :
error correction; feature extraction; hidden Markov models; linguistics; speech recognition; statistical analysis; HMM systems; alphabet recognition; class-specific feature extraction; distinctive features; error correcting device; error rate reduction; letter recognition accuracies; linguistically motivated acoustic feature; second pass; spectral measures; standard HMM based first pass; statistically based speech recognizer; stops; two pass strategy; voice onset time; voiced/voiceless distinction; Acoustic measurements; Automatic speech recognition; Error analysis; Error correction; Feature extraction; Hidden Markov models; Measurement standards; Performance analysis; Speech recognition; Standards development;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.674355