• DocumentCode
    1819324
  • Title

    An HMM-based Cantonese speech synthesis system

  • Author

    Xin Wang ; Zhiyong Wu

  • Author_Institution
    Tsinghua-CUHK Joint Res. Center for Media Sci., Tsinghua Univ., Shenzhen, China
  • fYear
    2012
  • fDate
    18-20 Nov. 2012
  • Firstpage
    141
  • Lastpage
    142
  • Abstract
    This paper describes a Cantonese HMM-based speech synthesis system (HTS) using the general architecture of Crystal - a multilingual text-to-speech (TTS) framework developed in Tsinghua University. The generated synthesis engine of HTS has advantage of small footprint, the size of which is less than 7M bytes, and can be easily ported to embedded electronic devices such as smart-phones, set-top boxes, etc. Furthermore, the quality of the synthetic speech can be easily characterized by modifying the synthetic acoustic parameters of the proposed system. The result shows noticeable improvement in naturalness and smoother transition than the corpus-based unit-selection concatenative speech synthesis approach.
  • Keywords
    hidden Markov models; speech synthesis; Cantonese HMM-based speech synthesis system; HTS; TTS; Tsinghua University; corpus-based unit-selection concatenative speech synthesis approach; crystal architecture; embedded electronic device; multilingual text-to-speech framework; set-top box; smart-phone; synthetic acoustic parameter; Cantonese; HMM model; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Global High Tech Congress on Electronics (GHTCE), 2012 IEEE
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-1-4673-5086-0
  • Type

    conf

  • DOI
    10.1109/GHTCE.2012.6490141
  • Filename
    6490141