DocumentCode
1112494
Title
Auditory nerve representation criteria for speech analysis/Synthesis
Author
Ghitza, Oded
Author_Institution
AT&T Bell Laboratories, Murray Hill, NJ
Volume
35
Issue
6
fYear
1987
fDate
6/1/1987 12:00:00 AM
Firstpage
736
Lastpage
740
Abstract
Traditional speech analysis/synthesis techniques are designed to produce synthesized speech with a spectrum (or waveform) that is as close as possible to the original. It is suggested, instead, that representations of the synthetic and the original speech be matched at the auditory nerve level. This concept has been used in conjunction with the sinusoidal representation of speech analysis/synthesis suggested by McAulay and Quatieri [6]. Based on informal listening, the synthesized speech is natural, including some tonal artifact, and highly intelligible for various kinds of speech material, in both quiet and noisy environments. The inherent dominance property of the auditory nerve representation reduces the number of sinusoidal components needed for synthesis by approximately 70 percent, offering a potential for reduced data rate.
Keywords
Control system synthesis; Frequency; Humans; Nervous system; Quantization; Speech analysis; Speech enhancement; Speech synthesis; Vocoders; Working environment noise;
fLanguage
English
Journal_Title
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
0096-3518
Type
jour
DOI
10.1109/TASSP.1987.1165223
Filename
1165223
Link To Document