DocumentCode :
2990220
Title :
Mid-rate coding based on a sinusoidal representation of speech
Author :
McAulay, Robert J. ; Quatieri, Thomas F.
Author_Institution :
Massachusettes Institute of Technology, Lexington, Massachusettes
Volume :
10
fYear :
1985
fDate :
31138
Firstpage :
945
Lastpage :
948
Abstract :
In this paper a sinusoidal model for the speech waveform is used to develop a new analysis/synthesis technique that is characterized by the amplitudes, frequencies, and phases of the component sine waves. The resulting synthetic waveform preserves the waveform shape and is essentially perceptually indistinguishable from the original speech. Furthermore, in the presence of noise the perceptual characteristics of the speech and the noise are maintained. Based on this system, a coder operating at 8 kbps is developed that codes the amplitudes and phases of each of the sine wave components and uses a harmonic model to code all of the frequencies. Since not all of the phases can be coded, a high frequency regeneration technique is developed that exploits the properties of the sinusoidal representation of the coded baseband signal. Based on a relatively limited data base, computer simulation has demonstrated that coded speech of good quality can be achieved. A real-time simulation is being developed to provide a more thorough evaluation of the algorithm.
Keywords :
Baseband; Computational modeling; Computer simulation; Frequency synthesizers; Noise shaping; Shape; Speech analysis; Speech coding; Speech enhancement; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '85.
Type :
conf
DOI :
10.1109/ICASSP.1985.1168149
Filename :
1168149
Link To Document :
بازگشت