DocumentCode :
483496
Title :
Corpus-based Malay text-to-speech synthesis system
Author :
Swee, T.T. ; Salleh, S.H.S.
Author_Institution :
Fac. of Biomed. & Health Sci. Eng., Univ. Teknol. Malaysia, Skudai
fYear :
2008
fDate :
14-16 Oct. 2008
Firstpage :
1
Lastpage :
5
Abstract :
The main problem with current Malay text-to-speech (TTS) synthesis system is the poor quality of the generated speech sound. This poor quality is resulted from the inability of traditional TTS system to provide multiple choices of unit for generating more accurate synthesized speech. Most of the current available Malay TTS systems are utilizing diphone concatenation that only support a single unit for each existing diphone, thus it cannot provide more accurate selection of speech unit for concatenation. This project has implemented a variable length unit selection Malay text to speech system that is capable of providing more natural and accurate unit selection for synthesized speech. This paper proposes a method of combining both linguistic context and feature distance cost for selecting the best match unit. A set of digitized Malay word has been collected from Malay Internet news for Malay word frequency count. 381 sentences have been designed which cover around 70 percent of high frequency words from 10 million digitized word obtained from Malay Internet news. Then a unit selection method has been implemented to provide the capability of selecting a speech unit not only limited to phoneme, diphone or triphone but also a string of phonemes that can be matched directly to the database. A set of listening test namely modify rhythm test (MRT) has been carried out with 35 participants, which represented 86 percent of accuracy.
Keywords :
natural languages; speech synthesis; Internet news; Malay word frequency count; corpus-based Malay; digitized Malay word; diphone concatenation; feature distance cost; linguistic context; modify rhythm test; phoneme string; poor quality; text-to-speech synthesis system; variable length unit selection; Biomedical engineering; Databases; Frequency; Internet; Search engines; Smoothing methods; Speech processing; Speech synthesis; Testing; Turing machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, 2008. APCC 2008. 14th Asia-Pacific Conference on
Conference_Location :
Tokyo
Print_ISBN :
978-4-88552-232-1
Electronic_ISBN :
978-4-88552-231-4
Type :
conf
Filename :
4773661
Link To Document :
بازگشت