DocumentCode :
44968
Title :
Collection and Analysis of a Parkinson Speech Dataset With Multiple Types of Sound Recordings
Author :
Sakar, B.E. ; Isenkul, M.E. ; Sakar, C. Okan ; Sertbas, A. ; Gurgen, Fikret ; Delil, S. ; Apaydin, H. ; Kursun, O.
Author_Institution :
Dept. of Comput. Program., Bahcesehir Univ., Istanbul, Turkey
Volume :
17
Issue :
4
fYear :
2013
fDate :
Jul-13
Firstpage :
828
Lastpage :
834
Abstract :
There has been an increased interest in speech pattern analysis applications of Parkinsonism for building predictive telediagnosis and telemonitoring models. For this purpose, we have collected a wide variety of voice samples, including sustained vowels, words, and sentences compiled from a set of speaking exercises for people with Parkinson´s disease. There are two main issues in learning from such a dataset that consists of multiple speech recordings per subject: 1) How predictive these various types, e.g., sustained vowels versus words, of voice samples are in Parkinson´s disease (PD) diagnosis? 2) How well the central tendency and dispersion metrics serve as representatives of all sample recordings of a subject? In this paper, investigating our Parkinson dataset using well-known machine learning tools, as reported in the literature, sustained vowels are found to carry more PD-discriminative information. We have also found that rather than using each voice recording of each subject as an independent data sample, representing the samples of a subject with central tendency and dispersion metrics improves generalization of the predictive model.
Keywords :
audio recording; diseases; learning (artificial intelligence); medical signal processing; speech processing; PD-discriminative information; Parkinson disease; Parkinson speech dataset analysis; Parkinson speech dataset collection; Parkinsonism; dispersion metrics; machine learning tools; multiple speech recordings; sentences; sound recordings; speaking exercises; speech pattern analysis applications; sustained vowels; telediagnosis; telemonitoring models; voice recording; voice samples; words; Accuracy; Dispersion; Feature extraction; Measurement; Speech; Standards; Support vector machines; Central tendency and dispersion metrics; cross validation; multiple sound types; speech impairments; telediagnosis of Parkinson’s disease;
fLanguage :
English
Journal_Title :
Biomedical and Health Informatics, IEEE Journal of
Publisher :
ieee
ISSN :
2168-2194
Type :
jour
DOI :
10.1109/JBHI.2013.2245674
Filename :
6451090
Link To Document :
بازگشت