Title :
The Use of Wavelet Packet Transform and Artificial Neural Networks in Analysis and Classification of Dysphonic Voices
Author :
Crovato, César David Paredes ; Schuck, Adalberto
Author_Institution :
Univ. Fed. do Rio Grande do Sul, Alegre
Abstract :
This paper presents a dysphonic voice classification system using the wavelet packet transform and the best basis algorithm (BBA) as dimensionality reductor and 06 artificial neural networks (ANN) acting as specialist systems. Each ANN was a 03-layer multilayer perceptron with 64 input nodes, 01 output node and in the intermediary layer the number of neurons depends on the related training pathology group. The dysphonic voice database was separated in five pathology groups and one healthy control group. Each ANN was trained and associated with one of the 06 groups, and fed by the best base tree (BBT) nodes´ entropy values, using the multiple cross validation (MCV) method and the leave-one-out (LOO) variation technique and success rates obtained were 87.5%, 95.31%, 87.5%, 100%, 96.87% and 89.06% for the groups 01 to 06, respectively.
Keywords :
entropy; medical signal processing; multilayer perceptrons; patient diagnosis; signal classification; speech; speech processing; wavelet transforms; artificial neural networks; best base tree; best basis algorithm; dysphonic voices; entropy; leave-one-out variation; multilayer perceptron; multiple cross validation; neurons; voice analysis; voice classification; wavelet packet transform; Artificial neural networks; Basis algorithms; Databases; Multilayer perceptrons; Neurons; Pathology; Speech analysis; Wavelet analysis; Wavelet packets; Wavelet transforms; Acoustical analysis of voices; artificial neural network; dysphonic voice classification; wavelet packet transform; Algorithms; Artificial Intelligence; Diagnosis, Computer-Assisted; Humans; Neural Networks (Computer); Reproducibility of Results; Sensitivity and Specificity; Signal Processing, Computer-Assisted; Sound Spectrography; Voice Disorders;
Journal_Title :
Biomedical Engineering, IEEE Transactions on
DOI :
10.1109/TBME.2006.889780