Title :
Application of wavelet transforms for C/V segmentation on Mandarin speech signals
Author :
Chen, S.H. ; Wang, J.F.
Author_Institution :
Dept. of Electr. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
fDate :
4/1/2001 12:00:00 AM
Abstract :
It has been demonstrated that wavelet transforms can be developed to find the C/V segmentation point of a Mandarin speech signal. The basic idea is the utilisation of a specific function, the product function, for indicating the C/V segmentation point. Based on the wavelet transforms, the product function is generated from the appropriate approximation signal and detail signal of the input speech, and its energy profile contains the evidence for detecting the C/V segmentation point. It is shown that the C/V segmentation point can be obtained directly using of the product function and its energy profile. The main advantage of the proposed scheme is the capability of forward and directly searching for the C/V segmentation point, and there is no need to set any predetermined threshold. Thus, the pitch detector and backward-processing required in the conventional C/V segmentation algorithm are completely avoided. The analysis of the proposed algorithm on various types of Mandarin speech indicates considerable improvement over the conventional method. Experiments show that the overall accuracy rate of the proposed method reaches 95.4%
Keywords :
natural languages; signal resolution; speech processing; wavelet transforms; C/V segmentation algorithm; Mandarin speech signals segmentation; accuracy rate; approximation signal; backward-processing; energy profile; input speech; multiresolution analysis; pitch detector; product function; wavelet transforms;
Journal_Title :
Vision, Image and Signal Processing, IEE Proceedings -
DOI :
10.1049/ip-vis:20010151