DocumentCode :
542233
Title :
Pitch and speech-rate conversion using envelope modulation modeling
Author :
Yoshida, Kazuaki ; Kazama, Michiko ; Tohyama, Mildo
Author_Institution :
Kogakuin University, Hachioji-shi, Tokyo, 192-0015 Japan
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
This article describes a method of intelligible speech representation that uses narrow-band envelopes and their carriers. This method enables modification of the talker´s voice pitch and speech-rate without sacrificing intelligibility. The carrier, which shows the instantaneous phase, conveys pitch information, while the temporal envelope conveys speech-rate information and preserves speech intelligibility. The carriers, however, can be replaced by sinusoidal signals without severely degrading intelligibility or voice quality. Consequently, we can modify the pitch by shifting each envelope´s carrier-frequency and convert the speech-rate by stretching or shrinking the envelopes. These findings could be useful in frequency scaling of the speech spectrum to assist hearing-impaired listeners or in time scaling of the speech signal for speech signal reproduction.
Keywords :
Filter banks; Frequency modulation; Spectrogram; Speech; Time frequency analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743745
Filename :
5743745
Link To Document :
بازگشت