DocumentCode
394243
Title
Spectral modification for digital singing voice synthesis using asymmetric generalized Gaussians
Author
Lee, Matthew E. ; Smith, Mark J T
Author_Institution
Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
This paper examines the problem of modelling and resynthesis of voiced song with the goal of improving the subjective performance quality. A set of methods is introduced based on the sinusoidal model for speech which enables precise modification of spectral characteristics as well as vibrato structure while maintaining the original speech quality and naturalness of the voice. Spectral characteristics are modified by modelling the formant structure with a set of asymmetric generalized Gaussians. Subjective tests were conducted which show that the proposed methods are effective in providing high quality modifications to vocal characteristics.
Keywords
Gaussian processes; spectral analysis; speech intelligibility; speech synthesis; asymmetric generalized Gaussians; digital singing voice synthesis; formant structure modelling; sinusoidal speech model; spectral characteristics modification; subjective performance quality; subjective tests; vibrato modification; vocal characteristics; voice naturalness; voiced song modelling; voiced song resynthesis; Character recognition; Frequency; Gaussian processes; Image processing; Resonance; Signal processing; Signal synthesis; Speech analysis; Speech recognition; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198767
Filename
1198767
Link To Document