• DocumentCode
    868983
  • Title

    Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors

  • Author

    Godino-Llorente, J.I. ; Gómez-Vilda, P.

  • Author_Institution
    Dpt. of Ingenieria de Circuitos y Sistemas, Escuela Universitaria de Ingenieria Tecnica de Telecomunicacion, Valencia, Spain
  • Volume
    51
  • Issue
    2
  • fYear
    2004
  • Firstpage
    380
  • Lastpage
    384
  • Abstract
    It is well known that vocal and voice diseases do not necessarily cause perceptible changes in the acoustic voice signal. Acoustic analysis is a useful tool to diagnose voice diseases being a complementary technique to other methods based on direct observation of the vocal folds by laryngoscopy. Through the present paper two neural-network based classification approaches applied to the automatic detection of voice disorders will be studied. Structures studied are multilayer perceptron and learning vector quantization fed using short-term vectors calculated accordingly to the well-known Mel Frequency Coefficient cepstral parameterization. The paper shows that these architectures allow the detection of voice disorders-including glottic cancer-under highly reliable conditions. Within this context, the Learning Vector quantization methodology demonstrated to be more reliable than the multilayer perceptron architecture yielding 96% frame accuracy under similar working conditions.
  • Keywords
    cepstral analysis; learning (artificial intelligence); medical signal processing; multilayer perceptrons; signal classification; speech processing; vector quantisation; Mel Frequency Coefficient; acoustic voice signal; automatic detection; frame accuracy; glottic cancer; input speech signal; laryngoscopy; learning vector quantization; multilayer perceptron; neural network based detectors; neural-network based classification; short-term cepstral parameters; short-term vectors; voice diseases; voice impairments; Acoustic signal detection; Cancer detection; Cepstral analysis; Detectors; Diseases; Mel frequency cepstral coefficient; Multilayer perceptrons; Neural networks; Speech analysis; Vector quantization; Algorithms; Cluster Analysis; Databases, Factual; Diagnosis, Computer-Assisted; Fourier Analysis; Humans; Nerve Net; Pattern Recognition, Automated; Reproducibility of Results; Retrospective Studies; Sensitivity and Specificity; Speech Acoustics; Speech Production Measurement; Voice Disorders; Voice Quality;
  • fLanguage
    English
  • Journal_Title
    Biomedical Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9294
  • Type

    jour

  • DOI
    10.1109/TBME.2003.820386
  • Filename
    1262116