Title :
Forward masking phenomenon in concatenative speech synthesis
Author :
M. Cernak;G. Rozinaj
Author_Institution :
Fac. of Electr. Eng., Slovak Tech. Univ., Bratislava, Slovakia
fDate :
6/25/1905 12:00:00 AM
Abstract :
The approach described in the paper tries to get more knowledge to the concatenative text-to-speech system design. The knowledge is based on masking phenomenon of the inner ear, particularly of its temporal (forward) masking properties. Designing such knowledge-based system is suggested to use in the unit selection-based speech synthesis, as contemporary a prominent technique in concatenative synthesis, which utilizes a big speech corpus. The more prosodic variability the corpus captures, the more natural a synthetic voice sounds and there are more possibilities to occur a forward masking events during concatenation of selected candidate units from the corpus.
Keywords :
"Speech synthesis","Humans","Speech analysis","Speech coding","Frequency","Cost function","Knowledge based systems","Speech processing","Cepstral analysis","Ear"
Conference_Titel :
Video/Image Processing and Multimedia Communications, 2003. 4th EURASIP Conference focused on
Print_ISBN :
953-184-054-7
DOI :
10.1109/VIPMC.2003.1220544