DocumentCode
868983
Title
Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors
Author
Godino-Llorente, J.I. ; Gómez-Vilda, P.
Author_Institution
Dpt. of Ingenieria de Circuitos y Sistemas, Escuela Universitaria de Ingenieria Tecnica de Telecomunicacion, Valencia, Spain
Volume
51
Issue
2
fYear
2004
Firstpage
380
Lastpage
384
Abstract
It is well known that vocal and voice diseases do not necessarily cause perceptible changes in the acoustic voice signal. Acoustic analysis is a useful tool to diagnose voice diseases being a complementary technique to other methods based on direct observation of the vocal folds by laryngoscopy. Through the present paper two neural-network based classification approaches applied to the automatic detection of voice disorders will be studied. Structures studied are multilayer perceptron and learning vector quantization fed using short-term vectors calculated accordingly to the well-known Mel Frequency Coefficient cepstral parameterization. The paper shows that these architectures allow the detection of voice disorders-including glottic cancer-under highly reliable conditions. Within this context, the Learning Vector quantization methodology demonstrated to be more reliable than the multilayer perceptron architecture yielding 96% frame accuracy under similar working conditions.
Keywords
cepstral analysis; learning (artificial intelligence); medical signal processing; multilayer perceptrons; signal classification; speech processing; vector quantisation; Mel Frequency Coefficient; acoustic voice signal; automatic detection; frame accuracy; glottic cancer; input speech signal; laryngoscopy; learning vector quantization; multilayer perceptron; neural network based detectors; neural-network based classification; short-term cepstral parameters; short-term vectors; voice diseases; voice impairments; Acoustic signal detection; Cancer detection; Cepstral analysis; Detectors; Diseases; Mel frequency cepstral coefficient; Multilayer perceptrons; Neural networks; Speech analysis; Vector quantization; Algorithms; Cluster Analysis; Databases, Factual; Diagnosis, Computer-Assisted; Fourier Analysis; Humans; Nerve Net; Pattern Recognition, Automated; Reproducibility of Results; Retrospective Studies; Sensitivity and Specificity; Speech Acoustics; Speech Production Measurement; Voice Disorders; Voice Quality;
fLanguage
English
Journal_Title
Biomedical Engineering, IEEE Transactions on
Publisher
ieee
ISSN
0018-9294
Type
jour
DOI
10.1109/TBME.2003.820386
Filename
1262116
Link To Document