• DocumentCode
    2997246
  • Title

    Application of line-spectrum pairs to low-bit-rate speech encoders

  • Author

    Kang, George S. ; Fransen, Lawrence J.

  • Author_Institution
    Naval Research Laboratory, Washington, DC
  • Volume
    10
  • fYear
    1985
  • fDate
    31138
  • Firstpage
    244
  • Lastpage
    247
  • Abstract
    A low-bit-rate speech encoder must employ bit-saving measures to achieve intelligible and natural sounding synthesized speech. Some important measures are: (a) quantization of parameters based on their spectral-error sensitivities (i.e., coarser quantization for spectrally less sensitive parameters), and (b) quantization of parameters in accordance with properties of auditory perception (i.e., coarser quantization of the higher frequency components of the speech spectral envelope, and finer representation of spectral peaks than valleys). The use of Line-Spectrum Pairs (LSPs) makes it possible to employ these measures more readily than the better known reflection coefficients. As a result, the intelligibility of an LSP-based, pitch-excited vocoder operating at 800 bits/second (b/s) can be made as high as 87 for three male speakers (as measured by the Diagnostic Rhyme Test (DRT)) which is only 1.4 below that of the 2400-b/s LPC. Likewise, the intelligibility of a 4800-b/s nonpitch-excited vocoder is as high as 92.3 which compares favorably with scores from current 9600-b/s vocoders.
  • Keywords
    Encoding; Filters; Frequency; Linear predictive coding; Quantization; Reflection; Speech analysis; Speech synthesis; Testing; Transfer functions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '85.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1985.1168526
  • Filename
    1168526