Title :
Wavelet-based voiced/unvoiced classification algorithm
Author :
Jafer, E. ; Mahdi, A.E.
Author_Institution :
Dept. of Electron. & Comput. Eng., Limerick Univ., Ireland
Abstract :
A new wavelet-based algorithm for classification of speech into voiced and unvoiced segments is presented. The algorithm is based on statistical analysis of the frequency distribution of the average energy in the wavelet domain, and on the short-time zero-crossing rate of the speech signal. First, the ratio of the average energy in the wavelet low-bands to that in the wavelet highest-band for each speech segment is computed using a 4-level dyadic wavelet transform, and compared to a predetermined threshold. This is followed by measuring the zero-crossing rate of the segment and comparing it to a threshold equal to the median of the zero-crossing rates. An experimentally verified criterion based on the above two comparison processes is then applied to obtain the voicing decision. The performance of the algorithm has been evaluated using a large speech database. The algorithm is shown to perform well in the cases of both clean and noise-degraded speech.
Keywords :
discrete wavelet transforms; speech processing; statistical analysis; 4-level dyadic wavelet transform; algorithm performance evaluation; average energy frequency distribution; average energy ratio; noise-degraded speech; predetermined threshold; short-time zero-crossing rate median; speech classification; speech database; speech processing; speech signal; statistical analysis; voiced/unvoiced classification algorithm; voicing decision; wavelet domain; wavelet highest-band; wavelet low-band; wavelet-based algorithm; Classification algorithms; Discrete wavelet transforms; Frequency; Multimedia databases; Multiresolution analysis; Speech analysis; Speech enhancement; Speech processing; Wavelet analysis; Wavelet transforms;
Conference_Titel :
Video/Image Processing and Multimedia Communications, 2003. 4th EURASIP Conference focused on
Print_ISBN :
953-184-054-7
DOI :
10.1109/VIPMC.2003.1220540