• DocumentCode
    2020276
  • Title

    Waveform-based speech synthesis approach with a formant frequency modification

  • Author

    Mizuno, Hideyuki ; Abe, Masanobu ; Hirokawa, Tomohisa

  • Author_Institution
    NTT Human Interface Lab., Tokyo, Japan
  • Volume
    2
  • fYear
    1993
  • fDate
    27-30 April 1993
  • Firstpage
    195
  • Abstract
    A novel approach to speech synthesis based on waveform segments is proposed. One novel point of this approach is its new formant frequency modification algorithm which makes it possible to change formant frequency flexibly and so reproduce the desired speech quality. The algorithm characterizes speech formants not only by formant frequencies and formant bandwidths, but also by spectral intensities of formant frequencies. The desirable formant structure, which is specified by the parameters, is obtained by iteratively modifying the formant bandwidths. Using the specified formant structure, the speech signal is synthesized by the FFT (fast Fourier transform). Evaluation by the acoustic distance measure and by listening tests confirms the good performance of the approach. As evaluated by listening tests, the proposed method was found to increase significantly the naturalness of speech and clearly to increase speech quality.<>
  • Keywords
    fast Fourier transforms; iterative methods; speech synthesis; waveform analysis; FFT; acoustic distance measure; algorithm; fast Fourier transform; formant bandwidths; formant frequency modification; listening tests; naturalness; performance; spectral intensities; speech quality; speech synthesis; waveform segments;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
  • Conference_Location
    Minneapolis, MN, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1993.319267
  • Filename
    319267