• DocumentCode
    542233
  • Title

    Pitch and speech-rate conversion using envelope modulation modeling

  • Author

    Yoshida, Kazuaki ; Kazama, Michiko ; Tohyama, Mildo

  • Author_Institution
    Kogakuin University, Hachioji-shi, Tokyo, 192-0015 Japan
  • Volume
    1
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    This article describes a method of intelligible speech representation that uses narrow-band envelopes and their carriers. This method enables modification of the talker´s voice pitch and speech-rate without sacrificing intelligibility. The carrier, which shows the instantaneous phase, conveys pitch information, while the temporal envelope conveys speech-rate information and preserves speech intelligibility. The carriers, however, can be replaced by sinusoidal signals without severely degrading intelligibility or voice quality. Consequently, we can modify the pitch by shifting each envelope´s carrier-frequency and convert the speech-rate by stretching or shrinking the envelopes. These findings could be useful in frequency scaling of the speech spectrum to assist hearing-impaired listeners or in time scaling of the speech signal for speech signal reproduction.
  • Keywords
    Filter banks; Frequency modulation; Spectrogram; Speech; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743745
  • Filename
    5743745