• DocumentCode
    134238
  • Title

    The text analysis and processing of Thai language text to speech conversion system

  • Author

    Xuee Lin ; Jian Yang ; Juan Zhao

  • Author_Institution
    Sch. of Inf., Yunnan Univ., Kunming, China
  • fYear
    2014
  • fDate
    12-14 Sept. 2014
  • Firstpage
    436
  • Lastpage
    436
  • Abstract
    Summary form only given. The text to speech conversion system is divided into front-end and back-end analysis module speech synthesis module, text analysis and processing will directly determine the effect of naturalness of synthesized speech. In the real Thai texts, there are often contained non-Thai standard characters such as numbers, abbreviations, currency symbols, and so on. Text normalization is the process which is switched the non-standard words into a standard word process. Thai text normalization difficulties mainly expressed for two indexes: The first is the non-standard word recognition judgment; the second is the ambiguity of the non-standard word processing carried disambiguation. This paper proposes a rule-based Thai text and context to determine the combination of the words associated with normalization method. The experimental results show that this method can achieve the desired results, the within test set´s accuracy rate is 97.83%, the outside set´s test accuracy rate is 97.61%.
  • Keywords
    speech synthesis; text analysis; Thai language; back-end analysis module; front-end analysis module; normalization method; speech synthesis; speech synthesis module; text analysis; text normalization; text processing; text-to-speech conversion system; word processing; Accuracy; Educational institutions; Indexes; Speech; Speech processing; Standards; Text analysis; disambiguation; text analysis; text normalization; text to speech conversion; the maximum matching algorithm; the nonstandard word;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2014.6936630
  • Filename
    6936630