• DocumentCode
    1827824
  • Title

    Digital singing voice synthesis using a new alternating reflection model

  • Author

    Lee, Matthew E. ; Smith, Mark J T

  • Author_Institution
    Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
  • Volume
    2
  • fYear
    2002
  • fDate
    2002
  • Abstract
    Many models for computer generated singing voices have been proposed in the past and have been shown to produce a wide variety of synthesized voices. While many of these models are capable of synthesizing a particular singing voice with high musical quality, they typically are challenged with respect to naturalness, range, the ability to synthesize both male and female voices, as well as the ability to capture the identity of the singer. The analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model has proven to be effective in producing high quality voices with manageable computational cost. It is based on the combination of a block overlap-add sinusoidal representation and an analysis-by-synthesis parameter estimation technique. ABS/OLA is flexible enough to allow for modifications such as time and pitch scaling; however, it can suffer from quality degradation under such conditions. This paper presents an analysis/synthesis model that incorporates new methods to improve synthesis. These improvements add to the naturalness and flexibility in controlling perceptually important musical characteristics
  • Keywords
    music; signal representation; speech intelligibility; speech synthesis; ABS/OLA sinusoidal model; alternating reflection model; analysis-by-synthesis parameter estimation; analysis-by-synthesis/overlap-add model; analysis/synthesis model; block overlap-add sinusoidal representation; computational cost; computer generated singing voices; digital singing voice synthesis; female voices; high musical quality; male voices; musical characteristics; naturalness; pitch scaling; range; synthesized voices; third formant; time scaling; vibrato synchronous concatenation; Costs; Degradation; Frequency modulation; Image processing; Quality management; Reflection; Signal generators; Signal processing; Signal synthesis; Speech analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems, 2002. ISCAS 2002. IEEE International Symposium on
  • Conference_Location
    Phoenix-Scottsdale, AZ
  • Print_ISBN
    0-7803-7448-7
  • Type

    conf

  • DOI
    10.1109/ISCAS.2002.1011490
  • Filename
    1011490