DocumentCode
323571
Title
Natural number recognition using MCE trained inter-word context dependent acoustic models
Author
Gandhi, Malan B. ; Jacob, John
Author_Institution
Bell Labs., Lucent Technol., Naperville, IL, USA
Volume
1
fYear
1998
fDate
12-15 May 1998
Firstpage
457
Abstract
Among applications that require number recognition, the focus has largely been on connected digit recognizers. We introduce an acoustic model topology for natural number recognition by using minimum classification error (MCE) training of inter-word context dependent models of the head-body-tail (HBT) type. Experimental results on natural number applications involving dollar amounts and US telephone numbers show that using HBT models for natural number data reduces string error rates by as much as 25% over context independent whole word models. In addition, for speech input which is strictly of connected digit type, the increase in string error rates is negligible when a natural number telephone grammar is used instead of a connected digit telephone grammar. This will enable natural number speech recognition systems to be more widely accepted because the recognition accuracy is maintained while permitting a more natural and flexible user interface
Keywords
acoustic signal processing; context-sensitive grammars; error statistics; pattern classification; speech recognition; MCE trained acoustic models; US telephone numbers; acoustic model topology; connected digit recognizers; connected digit telephone grammar; context independent whole word models; dollar amounts; experimental results; head-body-tail models; inter-word context dependent acoustic models; minimum classification error; minimum classification error training; natural number speech recognition systems; natural number telephone grammar; recognition accuracy; speech input; string error rates reduction; user interface; Acoustic applications; Context modeling; Error analysis; Heterojunction bipolar transistors; Hidden Markov models; Jacobian matrices; Speech recognition; Telephony; Topology; User interfaces;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.674466
Filename
674466
Link To Document