Title :
Pitch and speech-rate conversion using envelope modulation modeling
Author :
Yoshida, Kazuaki ; Kazama, Michiko ; Tohyama, Mildo
Author_Institution :
Kogakuin University, Hachioji-shi, Tokyo, 192-0015 Japan
Abstract :
This article describes a method of intelligible speech representation that uses narrow-band envelopes and their carriers. This method enables modification of the talker´s voice pitch and speech-rate without sacrificing intelligibility. The carrier, which shows the instantaneous phase, conveys pitch information, while the temporal envelope conveys speech-rate information and preserves speech intelligibility. The carriers, however, can be replaced by sinusoidal signals without severely degrading intelligibility or voice quality. Consequently, we can modify the pitch by shifting each envelope´s carrier-frequency and convert the speech-rate by stretching or shrinking the envelopes. These findings could be useful in frequency scaling of the speech spectrum to assist hearing-impaired listeners or in time scaling of the speech signal for speech signal reproduction.
Keywords :
Filter banks; Frequency modulation; Spectrogram; Speech; Time frequency analysis;
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.2002.5743745