• DocumentCode
    3425244
  • Title

    Discriminative training for improving letter-to-sound conversion performance

  • Author

    Chen, Yi-Ning ; Liu, Peng ; You, Jia-Li ; Soong, Frank K.

  • Author_Institution
    Microsoft Res. Asia, Beijing
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4649
  • Lastpage
    4652
  • Abstract
    In this paper, we propose to use discriminative training (DT) for improving letter-to-sound (LTS) conversion performance. LTS is a critical component in both ASR and TTS for predicting the correct pronunciation of a word not included in the lexicon. For TTS applications, predicting the proper pronunciation of an out-of-vocabulary person/place name, especially a name with foreign origin can be challenging. We utilize discriminative training, which has been successfully used in speech recognition, to sharpen the baseline N-grams of grapheme-phoneme pairs. We address the problem in a unified framework of discriminative training. Two criteria, maximum mutual information (MMI) and minimum phoneme error (MPE), are investigated. Experimental results show that DT yields a small (3.8-4.6% relative) but consistent error reduction across all databases tested. In addition, we observe that by pinpointing the local errors in a finer resolution, we can obtain a better discriminative model.
  • Keywords
    speech recognition; speech synthesis; correct pronunciation prediction; discriminative training; grapheme-phoneme pairs; letter-to-sound conversion performance; maximum mutual information; minimum phoneme error; speech recognition; Asia; Automatic speech recognition; Databases; Decision trees; Decoding; Mutual information; Natural languages; Speech recognition; Speech synthesis; Statistical analysis; Discriminative Training; Graphoneme; Letter-to-Sound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518693
  • Filename
    4518693