• DocumentCode
    302309
  • Title

    Interpolating V/UV mixture functions of a harmonic model for concatenative speech synthesis

  • Author

    Lam, King-fai ; Chan, Cheung-Fat

  • Author_Institution
    Dept. of Electron. Eng., City Univ. of Hong Kong, Hong Kong
  • Volume
    1
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    393
  • Abstract
    A high quality speech synthesis method based on interpolating the voiced/unvoiced (V/UV) mixture functions of the multiband excitation model (MBE) is proposed. In the MBE model, each harmonic band of the fundamental frequency in an excitation spectrum is rigidly declared as either voiced or unvoiced and the harmonic band is pitch-dependent. In the proposed method, each harmonic band in a short time spectrum is synthesized by mixing both voiced and unvoiced energies. The ratio of the V/UV energies in a spectrum is determined by the V/UV mixture function which is subsequently parametrized by an all-zero model. Since the V/UV decision in the proposed method is not rigidly declared and the V/UV mixture function is pitch-independent, interpolating the V/UV excitation spectrum becomes possible. Smooth transition of excitation between acoustic units can be achieved by interpolating the V/UV mixture functions of adjacent frames. Simulation results show that by incorporating the V/UV mixture function for concatenative synthesis, significant improvement in synthetic speech quality can be achieved
  • Keywords
    acoustic signal processing; harmonic analysis; interpolation; parameter estimation; spectral analysis; speech processing; speech synthesis; V/UV mixture functions; acoustic units; all-zero model; concatenative speech synthesis; excitation spectrum; fundamental frequency; harmonic band; harmonic model; interpolation; multiband excitation model; short time spectrum; simulation results; speech frames; synthetic speech quality; unvoiced energies; voiced energies; voiced/unvoiced mixture functions; Electronic mail; Frequency synthesizers; Speech processing; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.541115
  • Filename
    541115