• DocumentCode
    483496
  • Title

    Corpus-based Malay text-to-speech synthesis system

  • Author

    Swee, T.T. ; Salleh, S.H.S.

  • Author_Institution
    Fac. of Biomed. & Health Sci. Eng., Univ. Teknol. Malaysia, Skudai
  • fYear
    2008
  • fDate
    14-16 Oct. 2008
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    The main problem with current Malay text-to-speech (TTS) synthesis system is the poor quality of the generated speech sound. This poor quality is resulted from the inability of traditional TTS system to provide multiple choices of unit for generating more accurate synthesized speech. Most of the current available Malay TTS systems are utilizing diphone concatenation that only support a single unit for each existing diphone, thus it cannot provide more accurate selection of speech unit for concatenation. This project has implemented a variable length unit selection Malay text to speech system that is capable of providing more natural and accurate unit selection for synthesized speech. This paper proposes a method of combining both linguistic context and feature distance cost for selecting the best match unit. A set of digitized Malay word has been collected from Malay Internet news for Malay word frequency count. 381 sentences have been designed which cover around 70 percent of high frequency words from 10 million digitized word obtained from Malay Internet news. Then a unit selection method has been implemented to provide the capability of selecting a speech unit not only limited to phoneme, diphone or triphone but also a string of phonemes that can be matched directly to the database. A set of listening test namely modify rhythm test (MRT) has been carried out with 35 participants, which represented 86 percent of accuracy.
  • Keywords
    natural languages; speech synthesis; Internet news; Malay word frequency count; corpus-based Malay; digitized Malay word; diphone concatenation; feature distance cost; linguistic context; modify rhythm test; phoneme string; poor quality; text-to-speech synthesis system; variable length unit selection; Biomedical engineering; Databases; Frequency; Internet; Search engines; Smoothing methods; Speech processing; Speech synthesis; Testing; Turing machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, 2008. APCC 2008. 14th Asia-Pacific Conference on
  • Conference_Location
    Tokyo
  • Print_ISBN
    978-4-88552-232-1
  • Electronic_ISBN
    978-4-88552-231-4
  • Type

    conf

  • Filename
    4773661