• DocumentCode
    1652112
  • Title

    An HMM-based Vietnamese speech synthesis system

  • Author

    Vu, Thang Tat ; Luong, Mai Chi ; Nakamura, Satoshi

  • Author_Institution
    NICT - Nat. Inst. of Inf. & Commun. Technol., Japan
  • fYear
    2009
  • Firstpage
    116
  • Lastpage
    121
  • Abstract
    This paper describes an approach to the realization of a Vietnamese speech synthesis system applying a technique whereby speech is directly synthesized from Hidden Markov models (HMMs). Spectrum, pitch, and phone duration are simultaneously modeled in HMMs and their parameter distributions are clustered independently by using decision tree-based context clustering algorithms. Several contextual factors such as tone types, syllables, words, phrases, and utterances were determined and are taken into account to generate the spectrum, pitch, and state duration. The resulting system yields significant correctness for a tonal language, and a fair reproduction of the prosody.
  • Keywords
    decision trees; hidden Markov models; natural language processing; pattern clustering; speech synthesis; statistical distributions; HMM-based Vietnamese speech synthesis system; decision tree-based context clustering algorithm; hidden Markov model; parameter distribution; phone duration; pitch model; spectrum model; syllables; tonal language; tone type; utterances; Clustering algorithms; Communications technology; Context modeling; Databases; Decision trees; Frequency; Hidden Markov models; Natural languages; Speech synthesis; HMM-based; Vietnamese synthesis; tone;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech Database and Assessments, 2009 Oriental COCOSDA International Conference on
  • Conference_Location
    Urumqi
  • Print_ISBN
    978-1-4244-4400-7
  • Electronic_ISBN
    978-1-4244-4400-7
  • Type

    conf

  • DOI
    10.1109/ICSDA.2009.5278366
  • Filename
    5278366