DocumentCode :
1105607
Title :
Improvement of the excitation source in the narrow-band linear prediction vocoder
Author :
Kang, George S. ; Everett, Stephanie S.
Author_Institution :
Naval Research Laboratory, Washington, DC, USA
Volume :
33
Issue :
2
fYear :
1985
fDate :
4/1/1985 12:00:00 AM
Firstpage :
377
Lastpage :
386
Abstract :
The major weakness of the current narrow-band LPC synthesizer lies in the use of a "canned" invariant excitation signal, The use of such an excitation signal is based on three primary assumptions, namely, 1) that the amplitude spectrum of the excitation signal is flat and time invariant, 2) that the phase spectrum of the voiced excitation signal is a time-invariant function of frequency, and 3) that the probability density function of the phase spectrum of the unvoiced excitation signal is also time invariant. This paper critically examines these assumptions and presents modifications which improve the quality of the synthesized speech without requiring the transmission of additional data. Diagnostic acceptability measure (DAM) tests show an increase of up to five points in overall speech quality with the implementation of each of these improvements. These modifications can also improve the speech quality of LPC-based speech synthesizers.
Keywords :
Frequency; Linear predictive coding; Narrowband; Signal processing; Speech analysis; Speech enhancement; Speech processing; Speech synthesis; Synthesizers; Vocoders;
fLanguage :
English
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
0096-3518
Type :
jour
DOI :
10.1109/TASSP.1985.1164556
Filename :
1164556
Link To Document :
بازگشت