Title :
Algorithm amalgam: morphing waveform based methods, sinusoidal models and STRAIGHT
Author :
Kawahara, Hideki ; Banno, Hideki ; Irino, Toshio ; Zolfaghari, Parham
Author_Institution :
Fac. of Syst. Eng., Wakayama Univ., Japan
Abstract :
A tool to investigate an important fundamental question in speech processing is proposed aiming to promote research on voice quality and para and non linguistic aspects of speech. The proposed method effectively emulates waveform-based methods, sinusoidal models and the high quality source filter model STRAIGHT The key idea that enables blending these seemingly disjoint algorithms is a group delay based representation of signal excitation. By using a STRAIGHT-based smoothed time-frequency representation that is shared by these three types of speech processing methods, a unified source representation is used to implement the proposed system. Informal listening tests using the proposed system indicated that phase manipulation introduces different timbre, but it does not need to reproduce the exact waveform to reproduce the same timbre. This may suggest that the possibility of further information reduction exists in synthesizing close to natural quality speech.
Keywords :
delay estimation; signal representation; smoothing methods; speech processing; speech synthesis; time-frequency analysis; STRAIGHT; algorithm amalgam; group delay based representation; high quality source filter model; informal listening tests; morphing; natural quality speech synthesis; nonlinguistic aspects; paralinguistic aspects; signal excitation; sinusoidal models; smoothed time-frequency representation; speech processing; unified source representation; voice quality; waveform based methods; Cities and towns; Filters; Laboratories; Natural languages; Speech analysis; Speech processing; Speech synthesis; Systems engineering and theory; Timbre; Time frequency analysis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1325910