DocumentCode
396850
Title
Scalable speech coding spanning the 4 Kbps divide
Author
Lukasiak, J. ; Burnett, I.S.
Author_Institution
Whisper Lab., Wollongong Univ., NSW, Australia
Volume
1
fYear
2003
fDate
1-4 July 2003
Firstpage
397
Abstract
This paper examines a scalable method for coding the LP residual. The scalable method is capable of increasing the accuracy of the reconstructed speech from a parametric representation at low rates to a more accurate waveform matched representation at higher rates. The method entails pitch length segmentation, decomposition into pulsed and noise components and modeling of the pulsed components using a fixed shape pulse model in a closed-loop, Analysis by Synthesis system. Subjective testing is presented that indicates that in addition to the AbyS modeling, the pulse parameter evolution must be constrained in synthesis. Results indicate that this proposed method is capable of producing perceptually scalable speech quality as the bit rate is increased through 4 kbps.
Keywords
hearing; linear predictive coding; signal reconstruction; signal representation; singular value decomposition; speech coding; speech synthesis; 4 Kbit/s; analysis-synthesis system; bit rate; closed-loop; fixed shape pulse model; linear predictive coding; noise component; pitch length segmentation; pulse decomposition; pulsed component; scalable speech coding; speech perception; speech quality; speech reconstruction; speech representation; waveform matching; Australia; Bit rate; Compression algorithms; Laboratories; Pulse shaping methods; Scalability; Speech analysis; Speech coding; Speech synthesis; Switches;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Its Applications, 2003. Proceedings. Seventh International Symposium on
Print_ISBN
0-7803-7946-2
Type
conf
DOI
10.1109/ISSPA.2003.1224724
Filename
1224724
Link To Document