• DocumentCode
    394243
  • Title

    Spectral modification for digital singing voice synthesis using asymmetric generalized Gaussians

  • Author

    Lee, Matthew E. ; Smith, Mark J T

  • Author_Institution
    Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    This paper examines the problem of modelling and resynthesis of voiced song with the goal of improving the subjective performance quality. A set of methods is introduced based on the sinusoidal model for speech which enables precise modification of spectral characteristics as well as vibrato structure while maintaining the original speech quality and naturalness of the voice. Spectral characteristics are modified by modelling the formant structure with a set of asymmetric generalized Gaussians. Subjective tests were conducted which show that the proposed methods are effective in providing high quality modifications to vocal characteristics.
  • Keywords
    Gaussian processes; spectral analysis; speech intelligibility; speech synthesis; asymmetric generalized Gaussians; digital singing voice synthesis; formant structure modelling; sinusoidal speech model; spectral characteristics modification; subjective performance quality; subjective tests; vibrato modification; vocal characteristics; voice naturalness; voiced song modelling; voiced song resynthesis; Character recognition; Frequency; Gaussian processes; Image processing; Resonance; Signal processing; Signal synthesis; Speech analysis; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198767
  • Filename
    1198767