DocumentCode
2020276
Title
Waveform-based speech synthesis approach with a formant frequency modification
Author
Mizuno, Hideyuki ; Abe, Masanobu ; Hirokawa, Tomohisa
Author_Institution
NTT Human Interface Lab., Tokyo, Japan
Volume
2
fYear
1993
fDate
27-30 April 1993
Firstpage
195
Abstract
A novel approach to speech synthesis based on waveform segments is proposed. One novel point of this approach is its new formant frequency modification algorithm which makes it possible to change formant frequency flexibly and so reproduce the desired speech quality. The algorithm characterizes speech formants not only by formant frequencies and formant bandwidths, but also by spectral intensities of formant frequencies. The desirable formant structure, which is specified by the parameters, is obtained by iteratively modifying the formant bandwidths. Using the specified formant structure, the speech signal is synthesized by the FFT (fast Fourier transform). Evaluation by the acoustic distance measure and by listening tests confirms the good performance of the approach. As evaluated by listening tests, the proposed method was found to increase significantly the naturalness of speech and clearly to increase speech quality.<>
Keywords
fast Fourier transforms; iterative methods; speech synthesis; waveform analysis; FFT; acoustic distance measure; algorithm; fast Fourier transform; formant bandwidths; formant frequency modification; listening tests; naturalness; performance; spectral intensities; speech quality; speech synthesis; waveform segments;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location
Minneapolis, MN, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.1993.319267
Filename
319267
Link To Document