Title :
Detection of voice disorders based on wavelet and prosody-related properties
Author :
Shahnaz, C. ; Fattah, S.A. ; Mahbub, U. ; Zhu, W.-P. ; Ahmad, M.O.
Author_Institution :
Dept. of Electrical and Electronic Engineering, BUET, Dhaka-1000, Bangladesh
Abstract :
This paper presents an approach to detect voice disorders based on wavelet and prosody-related voice properties. First, several statistical measures of the normalized energy contents of the Discrete Wavelet Transform (DWT) coefficients over all voice frames are determined. Then, similar statistical measures of some prosody-related voice properties, such as mean pitch, jitter and shimmer are also computed over all the frames. In order to form a feature vector to be used in both training and testing phases, a set of statistical measure of the normalized energy contents of the DWT coefficients is combined with a set of statistical measure of the extracted prosody-related voice properties. Here, the voice samples under consideration are assumed to be of two categories, namely healthy and disordered thus formulating the problem in the proposed method as a two-class problem to be solved. Finally, the feature vector as obtained above is fed to an Euclidean Distance based classifier to detect the disordered voice. By performing extensive simulations, it is shown that the statistical analysis based on wavelet and prosody-related properties are able to provide effective detection of a variety of voice disorders from the mixture of healthy and disordered voices.
Keywords :
Discrete wavelet transforms; Feature extraction; Jitter; Testing; Training; Vectors; Voice disorder; jitter; pitch; shimmer; statistical measures; wavelet transform;
Conference_Titel :
Circuits and Systems (ISCAS), 2012 IEEE International Symposium on
Conference_Location :
Seoul, Korea (South)
Print_ISBN :
978-1-4673-0218-0
DOI :
10.1109/ISCAS.2012.6271403