DocumentCode :
2020276
Title :
Waveform-based speech synthesis approach with a formant frequency modification
Author :
Mizuno, Hideyuki ; Abe, Masanobu ; Hirokawa, Tomohisa
Author_Institution :
NTT Human Interface Lab., Tokyo, Japan
Volume :
2
fYear :
1993
fDate :
27-30 April 1993
Firstpage :
195
Abstract :
A novel approach to speech synthesis based on waveform segments is proposed. One novel point of this approach is its new formant frequency modification algorithm which makes it possible to change formant frequency flexibly and so reproduce the desired speech quality. The algorithm characterizes speech formants not only by formant frequencies and formant bandwidths, but also by spectral intensities of formant frequencies. The desirable formant structure, which is specified by the parameters, is obtained by iteratively modifying the formant bandwidths. Using the specified formant structure, the speech signal is synthesized by the FFT (fast Fourier transform). Evaluation by the acoustic distance measure and by listening tests confirms the good performance of the approach. As evaluated by listening tests, the proposed method was found to increase significantly the naturalness of speech and clearly to increase speech quality.<>
Keywords :
fast Fourier transforms; iterative methods; speech synthesis; waveform analysis; FFT; acoustic distance measure; algorithm; fast Fourier transform; formant bandwidths; formant frequency modification; listening tests; naturalness; performance; spectral intensities; speech quality; speech synthesis; waveform segments;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.1993.319267
Filename :
319267
Link To Document :
بازگشت