DocumentCode :
1112494
Title :
Auditory nerve representation criteria for speech analysis/Synthesis
Author :
Ghitza, Oded
Author_Institution :
AT&T Bell Laboratories, Murray Hill, NJ
Volume :
35
Issue :
6
fYear :
1987
fDate :
6/1/1987 12:00:00 AM
Firstpage :
736
Lastpage :
740
Abstract :
Traditional speech analysis/synthesis techniques are designed to produce synthesized speech with a spectrum (or waveform) that is as close as possible to the original. It is suggested, instead, that representations of the synthetic and the original speech be matched at the auditory nerve level. This concept has been used in conjunction with the sinusoidal representation of speech analysis/synthesis suggested by McAulay and Quatieri [6]. Based on informal listening, the synthesized speech is natural, including some tonal artifact, and highly intelligible for various kinds of speech material, in both quiet and noisy environments. The inherent dominance property of the auditory nerve representation reduces the number of sinusoidal components needed for synthesis by approximately 70 percent, offering a potential for reduced data rate.
Keywords :
Control system synthesis; Frequency; Humans; Nervous system; Quantization; Speech analysis; Speech enhancement; Speech synthesis; Vocoders; Working environment noise;
fLanguage :
English
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
0096-3518
Type :
jour
DOI :
10.1109/TASSP.1987.1165223
Filename :
1165223
Link To Document :
بازگشت