• DocumentCode
    323571
  • Title

    Natural number recognition using MCE trained inter-word context dependent acoustic models

  • Author

    Gandhi, Malan B. ; Jacob, John

  • Author_Institution
    Bell Labs., Lucent Technol., Naperville, IL, USA
  • Volume
    1
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    457
  • Abstract
    Among applications that require number recognition, the focus has largely been on connected digit recognizers. We introduce an acoustic model topology for natural number recognition by using minimum classification error (MCE) training of inter-word context dependent models of the head-body-tail (HBT) type. Experimental results on natural number applications involving dollar amounts and US telephone numbers show that using HBT models for natural number data reduces string error rates by as much as 25% over context independent whole word models. In addition, for speech input which is strictly of connected digit type, the increase in string error rates is negligible when a natural number telephone grammar is used instead of a connected digit telephone grammar. This will enable natural number speech recognition systems to be more widely accepted because the recognition accuracy is maintained while permitting a more natural and flexible user interface
  • Keywords
    acoustic signal processing; context-sensitive grammars; error statistics; pattern classification; speech recognition; MCE trained acoustic models; US telephone numbers; acoustic model topology; connected digit recognizers; connected digit telephone grammar; context independent whole word models; dollar amounts; experimental results; head-body-tail models; inter-word context dependent acoustic models; minimum classification error; minimum classification error training; natural number speech recognition systems; natural number telephone grammar; recognition accuracy; speech input; string error rates reduction; user interface; Acoustic applications; Context modeling; Error analysis; Heterojunction bipolar transistors; Hidden Markov models; Jacobian matrices; Speech recognition; Telephony; Topology; User interfaces;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.674466
  • Filename
    674466