• DocumentCode
    396850
  • Title

    Scalable speech coding spanning the 4 Kbps divide

  • Author

    Lukasiak, J. ; Burnett, I.S.

  • Author_Institution
    Whisper Lab., Wollongong Univ., NSW, Australia
  • Volume
    1
  • fYear
    2003
  • fDate
    1-4 July 2003
  • Firstpage
    397
  • Abstract
    This paper examines a scalable method for coding the LP residual. The scalable method is capable of increasing the accuracy of the reconstructed speech from a parametric representation at low rates to a more accurate waveform matched representation at higher rates. The method entails pitch length segmentation, decomposition into pulsed and noise components and modeling of the pulsed components using a fixed shape pulse model in a closed-loop, Analysis by Synthesis system. Subjective testing is presented that indicates that in addition to the AbyS modeling, the pulse parameter evolution must be constrained in synthesis. Results indicate that this proposed method is capable of producing perceptually scalable speech quality as the bit rate is increased through 4 kbps.
  • Keywords
    hearing; linear predictive coding; signal reconstruction; signal representation; singular value decomposition; speech coding; speech synthesis; 4 Kbit/s; analysis-synthesis system; bit rate; closed-loop; fixed shape pulse model; linear predictive coding; noise component; pitch length segmentation; pulse decomposition; pulsed component; scalable speech coding; speech perception; speech quality; speech reconstruction; speech representation; waveform matching; Australia; Bit rate; Compression algorithms; Laboratories; Pulse shaping methods; Scalability; Speech analysis; Speech coding; Speech synthesis; Switches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Its Applications, 2003. Proceedings. Seventh International Symposium on
  • Print_ISBN
    0-7803-7946-2
  • Type

    conf

  • DOI
    10.1109/ISSPA.2003.1224724
  • Filename
    1224724