• DocumentCode
    1701908
  • Title

    Variable rate speech coding using STRAIGHT and temporal decomposition

  • Author

    Nguyen, Phu Chien ; Akagi, Masato

  • Author_Institution
    Japan Adv. Inst. of Sci. & Technol., Ishikawa, Japan
  • fYear
    2002
  • Firstpage
    26
  • Lastpage
    28
  • Abstract
    This paper presents a method for variable rate speech coding at an average rate around 1.8 kbps based on STRAIGHT, a high quality speech analysis-synthesis method. For encoding spectral information, limited error based event localizing temporal decomposition (LEBEL-TD) based vector quantization is used, where LEBEL-TD is a low delay method of temporal decomposition for line spectral frequency parameters. Meanwhile, F0 and noise ratio parameters are firstly described using the LEBEL-TD technique and then scalar quantized. Also, gain parameters are coded using spline interpolation. Subjective test results indicate that the performance of the proposed speech coding method is comparable to that of the 4.8 kbps US Federal Standard (FS-1016) CELP coder.
  • Keywords
    interpolation; linear predictive coding; spectral analysis; speech coding; speech synthesis; splines (mathematics); vector quantisation; 1.8 kbit/s; F0 parameters; LEBEL-TD; STRAIGHT; gain parameters; limited error based event localizing temporal decomposition; line spectral frequency parameters; noise ratio parameters; pitch-adaptive spectral analysis; reconstruction method; spectral information; speech analysis-synthesis method; spline interpolation; subjective test results; temporal decomposition; variable rate speech coding; vector quantization; Delay; Encoding; Frequency; Interpolation; Signal to noise ratio; Speech analysis; Speech coding; Spline; Testing; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech Coding, 2002, IEEE Workshop Proceedings.
  • Print_ISBN
    0-7803-7549-1
  • Type

    conf

  • DOI
    10.1109/SCW.2002.1215712
  • Filename
    1215712