Title :
An HMM-based Cantonese speech synthesis system
Author :
Xin Wang ; Zhiyong Wu
Author_Institution :
Tsinghua-CUHK Joint Res. Center for Media Sci., Tsinghua Univ., Shenzhen, China
Abstract :
This paper describes a Cantonese HMM-based speech synthesis system (HTS) using the general architecture of Crystal - a multilingual text-to-speech (TTS) framework developed in Tsinghua University. The generated synthesis engine of HTS has advantage of small footprint, the size of which is less than 7M bytes, and can be easily ported to embedded electronic devices such as smart-phones, set-top boxes, etc. Furthermore, the quality of the synthetic speech can be easily characterized by modifying the synthetic acoustic parameters of the proposed system. The result shows noticeable improvement in naturalness and smoother transition than the corpus-based unit-selection concatenative speech synthesis approach.
Keywords :
hidden Markov models; speech synthesis; Cantonese HMM-based speech synthesis system; HTS; TTS; Tsinghua University; corpus-based unit-selection concatenative speech synthesis approach; crystal architecture; embedded electronic device; multilingual text-to-speech framework; set-top box; smart-phone; synthetic acoustic parameter; Cantonese; HMM model; Speech synthesis;
Conference_Titel :
Global High Tech Congress on Electronics (GHTCE), 2012 IEEE
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4673-5086-0
DOI :
10.1109/GHTCE.2012.6490141