• DocumentCode
    2272008
  • Title

    Speech-to-Singing Synthesis: Converting Speaking Voices to Singing Voices by Controlling Acoustic Features Unique to Singing Voices

  • Author

    Saitou, Takeshi ; Goto, Masataka ; Unoki, Masashi ; Akagi, Masato

  • Author_Institution
    National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Umezono, Tsukuba, Ibaraki 305-8568, Japan. saitou-t@aist.go.jp
  • fYear
    2007
  • fDate
    21-24 Oct. 2007
  • Firstpage
    215
  • Lastpage
    218
  • Abstract
    This paper describes a speech-to-singing synthesis system that can synthesize a singing voice, given a speaking voice reading the lyrics of a song and its musical score. The system is based on the speech manipulation system STRAIGHT and comprises three models controlling three acoustic features unique to singing voices: the fundamental frequency (F0), phoneme duration, and spectrum. Given the musical score and its tempo, the F0 control model generates the F0 contour of the singing voice by controlling four types of F0 fluctuations: overshoot, vibrato, preparation, and fine fluctuation. The duration control model lengthens the duration of each phoneme in the speaking voice by considering the duration of its musical note. The spectral control model converts the spectral envelope of the speaking voice into that of the singing voice by controlling both the singing formant and the amplitude modulation of formants in synchronization with vibrato. Experimental results show that the proposed system can convert speaking voices into singing voices whose naturalness is almost the same as actual singing voices.
  • Keywords
    Acoustic applications; Acoustic signal processing; Automatic control; Conferences; Control system synthesis; Fluctuations; Loudspeakers; Signal synthesis; Speech analysis; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Signal Processing to Audio and Acoustics, 2007 IEEE Workshop on
  • Conference_Location
    New Paltz, NY, USA
  • Print_ISBN
    978-1-4244-1620-2
  • Electronic_ISBN
    978-1-4244-1619-6
  • Type

    conf

  • DOI
    10.1109/ASPAA.2007.4393001
  • Filename
    4393001