• DocumentCode
    3079344
  • Title

    A maximum entropy approach to Chinese grapheme-to-phoneme conversion

  • Author

    Tsai, Richard Tzong-Han ; Wang, Yu-Chun

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Yuan Ze Univ., Chungli, Taiwan
  • fYear
    2009
  • fDate
    10-12 Aug. 2009
  • Firstpage
    411
  • Lastpage
    416
  • Abstract
    Grapheme-to-phoneme (G2P) conversion plays an important role in speech synthesis. The main difficulty facing Chinese G2P conversion is that many Chinese characters are polyphonic, having more than one pronunciation. A Chinese G2P system must be able to pick the correct pronunciation from among several candidates. Contextual information on neighboring characters such as character n-grams, phonetic information, or position of the polyphone in a word or sentence is the key to correct prediction. Most previous works employed rule-based or rule-learning methods, which often suffered from data sparseness. In this paper, we propose a novel G2P approach to avoid data sparseness. Our method uses the maximum entropy (ME) model framework to represent contextual information as ME features. Our system achieves a top accuracy of 99.84%, which is significantly higher than other state-of-the-art rule-based and rule-learning methods. In addition, our approach consistently improves accuracy regardless of a character´s main pronunciation ratio. Further analysis also shows that the ME model is fast and efficient, requiring much less training and labeling time.
  • Keywords
    entropy; knowledge based systems; learning (artificial intelligence); speech synthesis; Chinese grapheme-to-phoneme conversion; Chinese polyphonic character; character n-gram; maximum entropy approach; phonetic information; rule-based learning method; speech synthesis; Computer science; Context modeling; Decision trees; Dictionaries; Entropy; Humans; Labeling; Laboratories; Speech synthesis; Synthesizers; Chinese Grapheme-to-Phoneme Conversion; Maximum Entropy Model; Speech Synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse & Integration, 2009. IRI '09. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • Print_ISBN
    978-1-4244-4114-3
  • Electronic_ISBN
    978-1-4244-4116-7
  • Type

    conf

  • DOI
    10.1109/IRI.2009.5211588
  • Filename
    5211588