• DocumentCode
    2703320
  • Title

    Inequality Maximum Entropy Classifier with Character Features for Polyphone Disambiguation in Mandarin TTS Systems

  • Author

    Xinnian Mao ; Yuan Dong ; Jinyu Han ; Dezhi Huang ; Haila Wang

  • Author_Institution
    France Telecom R&D Center, Beijing, China
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    Grapheme-to-phoneme (G2P) conversion is an important component in TTS systems. The difficulty in Chinese G2P conversion is to disambiguate the polyphones. In this paper, we formulate the polyphone disambiguation problem into a classification problem and propose a language independent classifier based on maximum entropy to address the issue. Furthermore, we introduce inequality smoothing to alleviate data sparseness and exploit language independent character features as linguistic knowledge. Experimental results show that the character features perform as well as the language dependent features such as words and part-of-speech, compared with the widely-used Gaussian smoothing, the inequality smoothing can greatly reduce the active features used in the classifier and achieve better performance. Our classifier achieves 96.35% in term of overall accuracy, greatly superior to 81.22% by using high-frequent "pin-yin"(Romanization of Chinese phoneme). Finally, we explore to merge all key polyphones into 6 groups and find that the overall accuracy only decreases about 2% and the active features are reduced more than 33% further.
  • Keywords
    feature extraction; linguistics; maximum entropy methods; smoothing methods; speech processing; speech synthesis; Chinese phoneme; Gaussian smoothing; Mandarin TTS systems; Romanization; character features; grapheme-to-phoneme conversion; inequality maximum entropy classifier; inequality smoothing; language independent character features; language independent classifier; polyphone disambiguation; Acoustic transducers; Entropy; Hidden Markov models; Natural languages; Research and development; Smoothing methods; Speech synthesis; Sun; Tagging; Telecommunications; Character Features; Grapheme-to-phoneme conversion; Inequality Smoothing; Maximum Entropy; Polyphone;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.367010
  • Filename
    4218198