DocumentCode
542233
Title
Pitch and speech-rate conversion using envelope modulation modeling
Author
Yoshida, Kazuaki ; Kazama, Michiko ; Tohyama, Mildo
Author_Institution
Kogakuin University, Hachioji-shi, Tokyo, 192-0015 Japan
Volume
1
fYear
2002
fDate
13-17 May 2002
Abstract
This article describes a method of intelligible speech representation that uses narrow-band envelopes and their carriers. This method enables modification of the talker´s voice pitch and speech-rate without sacrificing intelligibility. The carrier, which shows the instantaneous phase, conveys pitch information, while the temporal envelope conveys speech-rate information and preserves speech intelligibility. The carriers, however, can be replaced by sinusoidal signals without severely degrading intelligibility or voice quality. Consequently, we can modify the pitch by shifting each envelope´s carrier-frequency and convert the speech-rate by stretching or shrinking the envelopes. These findings could be useful in frequency scaling of the speech spectrum to assist hearing-impaired listeners or in time scaling of the speech signal for speech signal reproduction.
Keywords
Filter banks; Frequency modulation; Spectrogram; Speech; Time frequency analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location
Orlando, FL, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5743745
Filename
5743745
Link To Document