DocumentCode
294514
Title
4 kbps improved pitch prediction CELP speech coding with 20 ms frame
Author
Serizawa, Masahioro ; Ozawa, Kazaunori
Author_Institution
C&C Inf. Technol. Res. Labs., NEC Corp., Kawasaki, Japan
Volume
1
fYear
1995
fDate
9-12 May 1995
Firstpage
1
Abstract
This paper proposes a new pitch prediction method for 4 kbps CELP (code excited LPC) speech coding. In the conventional CELP speech coding, synthetic speech quality deteriorates rapidly at 4 kbps, especially for female and children´s speech with short pitch period. The important reason is that when the pitch period is shorter than the subframe length, simple repetition of the past excitation signal based on the estimated lag, not the true pitch prediction, is usually used in the adaptive codebook operation. The proposed pitch prediction method can carry out the true pitch prediction by utilizing the current subframe excitation codevector signal, when the pitch prediction parameters are determined. For further improvement, a split weighting method and a low-complexity harmonic and spectral perceptually-weighting method have also been developed. The informal listening test result shows that the 4 kbps coder with 20 msec subframe, utilizing all of the proposed improvements, achieves 0.2 MOS higher results than the coder without them
Keywords
adaptive codes; harmonic analysis; linear predictive coding; spectral analysis; speech coding; speech intelligibility; speech synthesis; 20 ms; 4 kbit/s; CELP speech coding; adaptive codebook; code excited LPC; estimated lag; excitation signal; frame; informal listening test result; low-complexity harmonic method; mean opinion score; pitch prediction method; pitch prediction parameters; short pitch period; spectral perceptually-weighting method; split weighting method; subframe excitation codevector signal; subframe length; synthetic speech quality; Degradation; Delay; Information technology; Linear predictive coding; National electric code; Prediction methods; Speech coding; Standardization; Testing; Vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479259
Filename
479259
Link To Document