Title :
Speech morphing by gradually changing spectrum parameter and fundamental frequency
Author_Institution :
NTT Human Interface Labs., Kanagawa, Japan
Abstract :
The paper proposes a new application of speech modification called “speech morphing”. In image processing, morphing is a well known technique that gradually changes one person´s face to that of someone else. Speech morphing produces similar results for speech; i.e., one person´s speech is gradually changed to that of someone else. Speech morphing makes it possible to create movies or multimedia entertainment together with image morphing. The proposed algorithm pitch-synchronously modifies fundamental frequency (F0) and DFT spectrum and outputs high quality speech. To clarify the balance of F0 modification and spectrum modification, listening tests were carried out using 20 male speakers. The results yielded the relationship between the amount of modification and speaker identity. In terms of overall performance, listening tests show that the proposed algorithm successfully generates smooth, high quality voice changes
Keywords :
spectral analysis; speech processing; speech synthesis; F0 modification; algorithm; gradually changed fundamental frequency; gradually changed spectrum parameter; high quality speech output; listening tests; male speakers; movies; multimedia entertainment; pitch-synchronously modified DFT spectrum; pitch-synchronously modified fundamental frequency; smooth high quality voice changes; speaker identity; spectrum modification; speech modification; speech morphing; Concatenated codes; Face; Frequency; Humans; Image processing; Laboratories; Motion pictures; Speech analysis; Speech synthesis; Testing;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607250