DocumentCode
134238
Title
The text analysis and processing of Thai language text to speech conversion system
Author
Xuee Lin ; Jian Yang ; Juan Zhao
Author_Institution
Sch. of Inf., Yunnan Univ., Kunming, China
fYear
2014
fDate
12-14 Sept. 2014
Firstpage
436
Lastpage
436
Abstract
Summary form only given. The text to speech conversion system is divided into front-end and back-end analysis module speech synthesis module, text analysis and processing will directly determine the effect of naturalness of synthesized speech. In the real Thai texts, there are often contained non-Thai standard characters such as numbers, abbreviations, currency symbols, and so on. Text normalization is the process which is switched the non-standard words into a standard word process. Thai text normalization difficulties mainly expressed for two indexes: The first is the non-standard word recognition judgment; the second is the ambiguity of the non-standard word processing carried disambiguation. This paper proposes a rule-based Thai text and context to determine the combination of the words associated with normalization method. The experimental results show that this method can achieve the desired results, the within test set´s accuracy rate is 97.83%, the outside set´s test accuracy rate is 97.61%.
Keywords
speech synthesis; text analysis; Thai language; back-end analysis module; front-end analysis module; normalization method; speech synthesis; speech synthesis module; text analysis; text normalization; text processing; text-to-speech conversion system; word processing; Accuracy; Educational institutions; Indexes; Speech; Speech processing; Standards; Text analysis; disambiguation; text analysis; text normalization; text to speech conversion; the maximum matching algorithm; the nonstandard word;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location
Singapore
Type
conf
DOI
10.1109/ISCSLP.2014.6936630
Filename
6936630
Link To Document