DocumentCode :
868983
Title :
Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors
Author :
Godino-Llorente, J.I. ; Gómez-Vilda, P.
Author_Institution :
Dpt. of Ingenieria de Circuitos y Sistemas, Escuela Universitaria de Ingenieria Tecnica de Telecomunicacion, Valencia, Spain
Volume :
51
Issue :
2
fYear :
2004
Firstpage :
380
Lastpage :
384
Abstract :
It is well known that vocal and voice diseases do not necessarily cause perceptible changes in the acoustic voice signal. Acoustic analysis is a useful tool to diagnose voice diseases being a complementary technique to other methods based on direct observation of the vocal folds by laryngoscopy. Through the present paper two neural-network based classification approaches applied to the automatic detection of voice disorders will be studied. Structures studied are multilayer perceptron and learning vector quantization fed using short-term vectors calculated accordingly to the well-known Mel Frequency Coefficient cepstral parameterization. The paper shows that these architectures allow the detection of voice disorders-including glottic cancer-under highly reliable conditions. Within this context, the Learning Vector quantization methodology demonstrated to be more reliable than the multilayer perceptron architecture yielding 96% frame accuracy under similar working conditions.
Keywords :
cepstral analysis; learning (artificial intelligence); medical signal processing; multilayer perceptrons; signal classification; speech processing; vector quantisation; Mel Frequency Coefficient; acoustic voice signal; automatic detection; frame accuracy; glottic cancer; input speech signal; laryngoscopy; learning vector quantization; multilayer perceptron; neural network based detectors; neural-network based classification; short-term cepstral parameters; short-term vectors; voice diseases; voice impairments; Acoustic signal detection; Cancer detection; Cepstral analysis; Detectors; Diseases; Mel frequency cepstral coefficient; Multilayer perceptrons; Neural networks; Speech analysis; Vector quantization; Algorithms; Cluster Analysis; Databases, Factual; Diagnosis, Computer-Assisted; Fourier Analysis; Humans; Nerve Net; Pattern Recognition, Automated; Reproducibility of Results; Retrospective Studies; Sensitivity and Specificity; Speech Acoustics; Speech Production Measurement; Voice Disorders; Voice Quality;
fLanguage :
English
Journal_Title :
Biomedical Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9294
Type :
jour
DOI :
10.1109/TBME.2003.820386
Filename :
1262116
Link To Document :
بازگشت