• DocumentCode
    3217096
  • Title

    A multiscale product approach for an automatic classification of voice disorders from endoscopic high-speed videos

  • Author

    Unger, Jonas ; Schuster, Martin ; Hecker, D.J. ; Schick, B. ; Lohscheller, J.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Appl. Sci. Trier, Trier, Germany
  • fYear
    2013
  • fDate
    3-7 July 2013
  • Firstpage
    7360
  • Lastpage
    7363
  • Abstract
    Direct observation of vocal fold vibration is indispensable for a clinical diagnosis of voice disorders. Among current imaging techniques, high-speed videoendoscopy constitutes a state-of-the-art method capturing several thousand frames per second of the vocal folds during phonation. Recently, a method for extracting descriptive features from phonovibrograms, a two-dimensional image containing the spatio-temporal pattern of vocal fold dynamics, was presented. The derived features are closely related to a clinically established protocol for functional assessment of pathologic voices. The discriminative power of these features for different pathologic findings and configurations has not been assessed yet. In the current study, a collective of 220 subjects is considered for two- and multi-class problems of healthy and pathologic findings. The performance of the proposed feature set is compared to conventional feature reduction routines and was found to clearly outperform these. As such, the proposed procedure shows great potential for diagnostical issues of vocal fold disorders.
  • Keywords
    biomedical optical imaging; endoscopes; feature extraction; high-speed optical techniques; image classification; medical disorders; medical image processing; spatiotemporal phenomena; speech; vibrations; video signal processing; automatic classification; clinical diagnosis; conventional feature reduction; descriptive feature extraction; endoscopic high-speed video; functional assessment; high-speed videoendoscopy; imaging technique; multiscale product approach; pathologic findings; pathologic voices; phonation; phonovibrogram; spatiotemporal pattern; two-dimensional image; vocal fold disorder; vocal fold dynamics; vocal fold vibration; Accuracy; Biological system modeling; Feature extraction; Principal component analysis; Vectors; Vibrations; Videos;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the IEEE
  • Conference_Location
    Osaka
  • ISSN
    1557-170X
  • Type

    conf

  • DOI
    10.1109/EMBC.2013.6611258
  • Filename
    6611258