DocumentCode
1819324
Title
An HMM-based Cantonese speech synthesis system
Author
Xin Wang ; Zhiyong Wu
Author_Institution
Tsinghua-CUHK Joint Res. Center for Media Sci., Tsinghua Univ., Shenzhen, China
fYear
2012
fDate
18-20 Nov. 2012
Firstpage
141
Lastpage
142
Abstract
This paper describes a Cantonese HMM-based speech synthesis system (HTS) using the general architecture of Crystal - a multilingual text-to-speech (TTS) framework developed in Tsinghua University. The generated synthesis engine of HTS has advantage of small footprint, the size of which is less than 7M bytes, and can be easily ported to embedded electronic devices such as smart-phones, set-top boxes, etc. Furthermore, the quality of the synthetic speech can be easily characterized by modifying the synthetic acoustic parameters of the proposed system. The result shows noticeable improvement in naturalness and smoother transition than the corpus-based unit-selection concatenative speech synthesis approach.
Keywords
hidden Markov models; speech synthesis; Cantonese HMM-based speech synthesis system; HTS; TTS; Tsinghua University; corpus-based unit-selection concatenative speech synthesis approach; crystal architecture; embedded electronic device; multilingual text-to-speech framework; set-top box; smart-phone; synthetic acoustic parameter; Cantonese; HMM model; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Global High Tech Congress on Electronics (GHTCE), 2012 IEEE
Conference_Location
Shenzhen
Print_ISBN
978-1-4673-5086-0
Type
conf
DOI
10.1109/GHTCE.2012.6490141
Filename
6490141
Link To Document