Title :
Scalable speech coding spanning the 4 Kbps divide
Author :
Lukasiak, J. ; Burnett, I.S.
Author_Institution :
Whisper Lab., Wollongong Univ., NSW, Australia
Abstract :
This paper examines a scalable method for coding the LP residual. The scalable method is capable of increasing the accuracy of the reconstructed speech from a parametric representation at low rates to a more accurate waveform matched representation at higher rates. The method entails pitch length segmentation, decomposition into pulsed and noise components and modeling of the pulsed components using a fixed shape pulse model in a closed-loop, Analysis by Synthesis system. Subjective testing is presented that indicates that in addition to the AbyS modeling, the pulse parameter evolution must be constrained in synthesis. Results indicate that this proposed method is capable of producing perceptually scalable speech quality as the bit rate is increased through 4 kbps.
Keywords :
hearing; linear predictive coding; signal reconstruction; signal representation; singular value decomposition; speech coding; speech synthesis; 4 Kbit/s; analysis-synthesis system; bit rate; closed-loop; fixed shape pulse model; linear predictive coding; noise component; pitch length segmentation; pulse decomposition; pulsed component; scalable speech coding; speech perception; speech quality; speech reconstruction; speech representation; waveform matching; Australia; Bit rate; Compression algorithms; Laboratories; Pulse shaping methods; Scalability; Speech analysis; Speech coding; Speech synthesis; Switches;
Conference_Titel :
Signal Processing and Its Applications, 2003. Proceedings. Seventh International Symposium on
Print_ISBN :
0-7803-7946-2
DOI :
10.1109/ISSPA.2003.1224724