Title :
Effects of Audio Compression in Automatic Detection of Voice Pathologies
Author :
Saenz-Lechon, N. ; Osma-Ruiz, Víctor ; Godino-Llorente, Juan I. ; Blanco-Velasco, Manuel ; Cruz-Roldan, F. ; Arias-Londono, J.D.
Author_Institution :
Dept. of Ing. de Circuitos y Sist., Univ. Politec. de Madrid, Madrid
Abstract :
This paper investigates the performance of an automatic system for voice pathology detection when the voice samples have been compressed in MP3 format and different binary rates (160, 96, 64, 48, 24, and 8 kb/s). The detectors employ cepstral and noise measurements, along with their derivatives, to characterize the voice signals. The classification is performed using Gaussian mixtures models and support vector machines. The results between the different proposed detectors are compared by means of detector error tradeoff (DET) and receiver operating characteristic (ROC) curves, concluding that there are no significant differences in the performance of the detector when the binary rates of the compressed data are above 64 kb/s. This has useful applications in telemedicine, reducing the storage space of voice recordings or transmitting them over narrow-band communications channels.
Keywords :
audio signal processing; data compression; medical signal processing; speech coding; support vector machines; telemedicine; DET curve; Gaussian mixtures models; MP3 format; ROC curve; audio compression effects; automatic system performance; automatic voice pathology detection system; cepstral measurements; detector error tradeoff curve; noise measurements; receiver operating characteristic curve; support vector machines; telemedicine; voice signal characterisation; Audio compression; Cepstral analysis; Detectors; Digital audio players; Narrowband; Noise measurement; Pathology; Support vector machine classification; Support vector machines; Telemedicine; Gaussian Mixture Models; Gaussian mixture models; MP3 compression; Support Vector Machines; Voice pathology detection; support vector machines; voice pathology detection; Artifacts; Artificial Intelligence; Data Compression; Fourier Analysis; Humans; Multimedia; Normal Distribution; Pattern Recognition, Automated; ROC Curve; Sound Spectrography; Speech Acoustics; Voice; Voice Disorders; Voice Quality;
Journal_Title :
Biomedical Engineering, IEEE Transactions on
DOI :
10.1109/TBME.2008.923769