• DocumentCode
    1749716
  • Title

    Automatic generation and selection of multiple pronunciations for dynamic vocabularies

  • Author

    Deligne, Sabine ; Maison, Benoit ; Gopinath, Ramesh

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    1
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    565
  • Abstract
    We present a scheme for the acoustic modeling of speech recognition applications requiring dynamic vocabularies. It applies especially to the acoustic modeling of out-of-vocabulary words which need to be added to a recognition lexicon based on the observation of a few (say one or two) speech utterances of these words. Standard approaches to this problem derive a single pronunciation from each speech utterance by combining acoustic and phone transition scores. In our scheme, multiple pronunciations are generated from each speech utterance of a word to enroll by varying the relative weights assigned to the acoustic and phone transition models. In our experiments, the use of these multiple baseforms dramatically outperforms the standard approach with a relative decrease of the word error rate ranging from 20% to 40% on all our test sets
  • Keywords
    probability; speech processing; speech recognition; acoustic modeling; dynamic vocabularies; multiple pronunciations; out-of-vocabulary words; phone transition models; recognition lexicon speech utterances; speech recognition; Acoustic applications; Engines; Error analysis; Keyboards; Robustness; Speech recognition; Testing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.940894
  • Filename
    940894