Title :
Low bit and variable rate speech coding using local cosine transform
Author :
Enqing, Dong ; Heming, Zhao ; YongLi, Li
Author_Institution :
Dept. of Commun. & Electron. Eng., Soochow Univ., Su Zhou, China
Abstract :
An average1.6kb/s low bit and variable rate speech coder based on local cosine transform (LCT) algorithm for a two-way conversational speech is designed for the first time in the paper. The result of the voice activity detector (VAD) based on support vector machine (SVM) and the classification method of the voicing modes of the GSM half rate standard for active speech are adopted in the design of the variable bit rate coder. The moderately voiced mode and the strongly voiced mode of the voicing modes are combined as a voicing mode, the new combined voicing mode is named as a moderately and strongly voiced mode. A few segment vector quantizers of the LCT coefficients for each voicing mode and silence voicing frame (background noise) are employed, and LGB algorithm is applied to design the codebooks. A tree fast search technique is used to select the vector of the LCT coefficients for each segment. The evaluation using subject informal listening tests and a few object parameters indicates that the speech quality (intelligibility and naturalness) of the designed speech coder is better than that of the FS1015 standard coder. The new coder has higher robust than the FS1015 standard coder, which is suitable for speech coding in any environments.
Keywords :
speech coding; speech intelligibility; support vector machines; transforms; vector quantisation; local cosine transform; object parameters; segment vector quantizers; speech coding; speech quality; subject informal listening tests; support vector machine; two-way conversational speech; voice activity detector; Algorithm design and analysis; Background noise; Bit rate; Code standards; Detectors; GSM; Speech analysis; Speech coding; Support vector machine classification; Support vector machines;
Conference_Titel :
TENCON '02. Proceedings. 2002 IEEE Region 10 Conference on Computers, Communications, Control and Power Engineering
Print_ISBN :
0-7803-7490-8
DOI :
10.1109/TENCON.2002.1181304