DocumentCode
2838926
Title
Prosody and style controls in CU VOCAL using SSML and SAPI XML tags
Author
Fung, Tien-Ying ; Li, Yuk-Chi ; Meng, Helen ; Ching, P.C.
Author_Institution
Human-Comput. Commun. Lab., Chinese Univ. of Hong Kong, Shatin, China
fYear
2004
fDate
15-18 Dec. 2004
Firstpage
209
Lastpage
212
Abstract
CU VOCAL is a Cantonese text-to-speech (TTS) engine. We use a syllable-based concatenative synthesis approach to generate intelligible and natural synthetic speech in Cantonese. The paper reports on our recent enhancements in CU VOCAL to support user adjustments in prosody and style with the use of the Speech Synthesis Markup Language (SSML) in the input text. CU VOCAL was previously developed as a SAPI-compliant engine to enable easy integration with other applications. The paper also reports on our enhancements in the CU VOCAL SAPI (speech API) engine to support the SAPI 5 XML tags.
Keywords
XML; application program interfaces; natural language interfaces; speech synthesis; Cantonese text-to-speech engine; SAPI XML tags; SSML; Speech Synthesis Markup Language; prosody; style; syllable-based concatenative synthesis; Communication system control; Digital signal processing; Engines; Laboratories; Markup languages; Research and development management; Speech synthesis; Systems engineering and theory; Technical Activities Guide -TAG; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing, 2004 International Symposium on
Print_ISBN
0-7803-8678-7
Type
conf
DOI
10.1109/CHINSL.2004.1409623
Filename
1409623
Link To Document