DocumentCode
1467725
Title
An 8-kb/s conjugate structure CELP (CS-CELP) speech coder
Author
Kataoka, Akitoshi ; Moriya, Takehiro ; Hayashi, Shinji
Author_Institution
NTT Human Interface Labs., Tokyo, Japan
Volume
4
Issue
6
fYear
1996
fDate
11/1/1996 12:00:00 AM
Firstpage
401
Lastpage
411
Abstract
This paper describes a high-quality 8-kb/s speech coder called conjugate structure code-excited linear prediction (CS-CELP) with a 10-ms frame length. To provide a short delay and high quality under both error-free and channel error conditions, it uses three new schemes: line spectrum pair (LSP) quantization using interframe prediction, preselection in the codebook search, and gain vector quantization (VQ) with backward prediction. The LSP parameters are quantized by using multistage VQ with moving-average (MA) prediction. This scheme can operate efficiently with various frequency responses of speech. The preselection of the codebook reduces the computational complexity and improves the robustness to channel errors. The gain VQ with backward prediction can provide a high quality and robustness without transmission of input speech power information. A conjugate structure for both random codebook and gain codebook is introduced to improve the ability to handle random bit errors and to reduce codebook storage memory requirements. Subjective testing indicates that the quality of this coder is equivalent to that of 32-kb/s adaptive differential pulse code modulation (ADPCM) under error-free conditions. Testing has further demonstrated that the coder is robust against random bit errors
Keywords
coding errors; linear predictive coding; moving average processes; spectral analysis; speech coding; telecommunication channels; vector quantisation; 8 kbit/s; ADPCM; CS-CELP speech coder; LSP parameters; LSP quantization; backward prediction; channel error conditions; code-excited linear prediction; codebook search preselection; codebook storage memory; computational complexity; conjugate structure CELP; delay; error-free conditions; frame length; frequency responses; gain codebook; gain vector quantization; interframe prediction; line spectrum pair; moving-average prediction; multistage VQ; random bit errors; random codebook; subjective testing; Decoding; Delay; Linear predictive coding; North America; Personal communication networks; Pulse modulation; Robustness; Speech analysis; Speech coding; Testing;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.544525
Filename
544525
Link To Document