DocumentCode :
1499431
Title :
Improved phase vocoder time-scale modification of audio
Author :
Laroche, Jean ; Dolson, Mark
Author_Institution :
Joint E-mu/Creative Technol. Center, Scotts Valley, CA, USA
Volume :
7
Issue :
3
fYear :
1999
fDate :
5/1/1999 12:00:00 AM
Firstpage :
323
Lastpage :
332
Abstract :
The phase vocoder is a well established tool for time scaling and pitch shifting speech and audio signals via modification of their short-time Fourier transforms (STFTs). In contrast to time-domain time-scaling and pitch-shifting techniques, the phase vocoder is generally considered to yield high quality results, especially for large modification factors and/or polyphonic signals. However, the phase vocoder is also known for introducing a characteristic perceptual artifact, often described as “phasiness”, “reverberation”, or “loss of presence”. This paper examines the problem of phasiness in the context of time-scale modification and provides new insights into its causes. Two extensions to the standard phase vocoder algorithm are introduced, and the resulting sound quality is shown to be significantly improved. Moreover, the modified phase vocoder is shown to provide a factor-of-two decrease in computational cost
Keywords :
Fourier transforms; audio coding; reverberation; vocoders; audio signals; computational cost reduction; loss of presence; perceptual artifact; phase vocoder; phasiness; pitch shifting; reverberation; short-time Fourier transforms; sound quality; speech signals; standard phase vocoder algorithm; time scaling; time-scale modification; Computational efficiency; Fourier transforms; Frequency modulation; Frequency synchronization; Reverberation; Signal processing; Speech processing; Telephony; Time domain analysis; Vocoders;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.759041
Filename :
759041
Link To Document :
بازگشت