DocumentCode :
2853081
Title :
Synthesis of unseen context and spectral and pitch contour smoothing in concatenated text to speech synthesis
Author :
Low, Phuay Hui ; Vaseghi, Saeed
Author_Institution :
Department of Electronic and Computer Engineering, Brunei University, London, UB8 3PH, UK
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
The availability and perceptual clarity of speech units, and how these units are put together during synthesis have always been the cornerstones of any high quality concatenative text-to-speech synthesis (TTS) system. The speech units are usually obtained from different sentences and contexts in a speaker-dependent speech database. One of the problems with speech units obtained this way is the occurrence of unseen contexts. Here, unseen contexts denote phonological sequences that are not acoustically represented in the selection pool during synthesis. Unseen units are expected in any concatenative TTS system because it is difficult to obtain an: acoustic representation of all possible existing contexts that could occur in speech. This paper proposes a pitch synchronous, overlap and merge method to synthesise the acoustic representation of unseen contexts from existing similar units found in the inventory. It also gives a brief description of spectral and pitch contour smoothing across concatenated units.
Keywords :
Artificial neural networks; Speech; Testing; Transforms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743756
Filename :
5743756
Link To Document :
بازگشت