DocumentCode
1105607
Title
Improvement of the excitation source in the narrow-band linear prediction vocoder
Author
Kang, George S. ; Everett, Stephanie S.
Author_Institution
Naval Research Laboratory, Washington, DC, USA
Volume
33
Issue
2
fYear
1985
fDate
4/1/1985 12:00:00 AM
Firstpage
377
Lastpage
386
Abstract
The major weakness of the current narrow-band LPC synthesizer lies in the use of a "canned" invariant excitation signal, The use of such an excitation signal is based on three primary assumptions, namely, 1) that the amplitude spectrum of the excitation signal is flat and time invariant, 2) that the phase spectrum of the voiced excitation signal is a time-invariant function of frequency, and 3) that the probability density function of the phase spectrum of the unvoiced excitation signal is also time invariant. This paper critically examines these assumptions and presents modifications which improve the quality of the synthesized speech without requiring the transmission of additional data. Diagnostic acceptability measure (DAM) tests show an increase of up to five points in overall speech quality with the implementation of each of these improvements. These modifications can also improve the speech quality of LPC-based speech synthesizers.
Keywords
Frequency; Linear predictive coding; Narrowband; Signal processing; Speech analysis; Speech enhancement; Speech processing; Speech synthesis; Synthesizers; Vocoders;
fLanguage
English
Journal_Title
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
0096-3518
Type
jour
DOI
10.1109/TASSP.1985.1164556
Filename
1164556
Link To Document