DocumentCode :
302343
Title :
Robust classification of speech based on the dyadic wavelet transform with application to CELP coding
Author :
Stegmann, Joachim ; Schröder, Gerhard ; Fischer, Kyrill A.
Author_Institution :
Deutsche Telekom AG, Darmstadt, Germany
Volume :
1
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
546
Abstract :
This paper describes a new algorithm for the classification of telephone-bandwidth speech that is designed for efficient control of bit allocation in low bit-rate speech coders. The algorithm is based on the dyadic wavelet transform (DyWT) and classifies each unit subframe into one of the three categories background noise/unvoiced, transients/voicing onsets, periodic/voiced. A set of three parameters is derived from the DyWT coefficients, each giving a decision score that the associated class is active. Taking the history into account, a finite-state model controlled by these parameters computes the classifier´s decision. The proposed algorithm is robust to various types of background noise. In comparison with a classifier based on the long-term autocorrelation function, the DyWT classifier proves to be superior. To evaluate its performance in CELP-type speech coders, a variety of excitation coding schemes with bit rates between 2200 and 4800 bit/s is investigated
Keywords :
finite state machines; linear predictive coding; pattern classification; speech coding; transform coding; vocoders; wavelet transforms; 2.2 to 4.8 kbit/s; CELP coding; associated class; background noise/unvoiced; decision score; dyadic wavelet transform; excitation coding schemes; finite-state model; low bit-rate speech coders; performance evaluation; periodic/voiced; robust algorithm; speech classification; speech coding; telephone-bandwidth speech; transients/voicing onsets; Algorithm design and analysis; Autocorrelation; Background noise; Bit rate; Classification algorithms; History; Noise robustness; Speech analysis; Speech coding; Wavelet transforms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.541154
Filename :
541154
Link To Document :
بازگشت