• DocumentCode
    1105607
  • Title

    Improvement of the excitation source in the narrow-band linear prediction vocoder

  • Author

    Kang, George S. ; Everett, Stephanie S.

  • Author_Institution
    Naval Research Laboratory, Washington, DC, USA
  • Volume
    33
  • Issue
    2
  • fYear
    1985
  • fDate
    4/1/1985 12:00:00 AM
  • Firstpage
    377
  • Lastpage
    386
  • Abstract
    The major weakness of the current narrow-band LPC synthesizer lies in the use of a "canned" invariant excitation signal, The use of such an excitation signal is based on three primary assumptions, namely, 1) that the amplitude spectrum of the excitation signal is flat and time invariant, 2) that the phase spectrum of the voiced excitation signal is a time-invariant function of frequency, and 3) that the probability density function of the phase spectrum of the unvoiced excitation signal is also time invariant. This paper critically examines these assumptions and presents modifications which improve the quality of the synthesized speech without requiring the transmission of additional data. Diagnostic acceptability measure (DAM) tests show an increase of up to five points in overall speech quality with the implementation of each of these improvements. These modifications can also improve the speech quality of LPC-based speech synthesizers.
  • Keywords
    Frequency; Linear predictive coding; Narrowband; Signal processing; Speech analysis; Speech enhancement; Speech processing; Speech synthesis; Synthesizers; Vocoders;
  • fLanguage
    English
  • Journal_Title
    Acoustics, Speech and Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0096-3518
  • Type

    jour

  • DOI
    10.1109/TASSP.1985.1164556
  • Filename
    1164556