DocumentCode :
1694618
Title :
Speech coding based on pitch synchrony and two-stage transformation
Author :
Xiao-ming Li ; Chang-chun Bao ; Kleijn, W. Bastiaan
Author_Institution :
Speech & Audio Signal Process. Lab., Beijing Univ. of Technol., Beijing, China
fYear :
2013
Firstpage :
8159
Lastpage :
8163
Abstract :
In this paper, an effective speech coder that is based on a sparse representation of speech by exploiting the strong dependencies between adjacent pitch cycles is proposed. In the proposed coder, a pitch-synchronous processing that consists of pitch warping and a two-stage transformation is used to achieve a compact representation of the voiced speech. Power spectral density preserving quantization (PSD-PQ) is adopted for quantizing the transform coefficients. The result is a coder that is efficient over a wide range of bit rates: it approaches perfect reconstruction with increasing rate, and has a parametric signal representation at low rates. Both objective PESQ results and subjective A/B listening tests show that the proposed coder outperforms the ITU-T G.722.1 codec.
Keywords :
codecs; signal representation; speech coding; ITU-T G.722.1 codec; parametric signal representation; pitch cycles; pitch synchrony; pitch warping; pitch-synchronous processing; power spectral density preserving quantization; sparse representation; speech coder; speech coding; transform coefficients; two-stage transformation; voiced speech; Bit rate; Modulation; Quantization (signal); Speech; Speech coding; Speech processing; Transforms; Speech coding; compact representation; pitch-synchronous; quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639255
Filename :
6639255
Link To Document :
بازگشت