Collection and Analysis of a Parkinson Speech Dataset With Multiple Types of Sound Recordings

Sakar, Betul; Isenkul, Muhammed; Sakar, C.; Sertbas, Ahmet; Gurgen, Fikret; Delil, Sakir; Apaydin, Hülya; Kursun, Olcay

doi:10.1109/jbhi.2013.2245674

Collection and Analysis of a Parkinson Speech Dataset With Multiple Types of Sound Recordings

Atıf İçin Kopyala

Sakar B. E., Isenkul M. E., Sakar C. O., Sertbas A., Gurgen F., Delil S., ...Daha Fazla

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, cilt.17, sa.4, ss.828-834, 2013 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 17 Sayı: 4
Basım Tarihi: 2013
Doi Numarası: 10.1109/jbhi.2013.2245674
Dergi Adı: IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.828-834
İstanbul Üniversitesi Adresli: Evet

There has been an increased interest in speech pattern analysis applications of Parkinsonism for building predictive telediagnosis and telemonitoring models. For this purpose, we have collected a wide variety of voice samples, including sustained vowels, words, and sentences compiled from a set of speaking exercises for people with Parkinson's disease. There are two main issues in learning from such a dataset that consists of multiple speech recordings per subject: 1) How predictive these various types, e. g., sustained vowels versus words, of voice samples are in Parkinson's disease (PD) diagnosis? 2) How well the central tendency and dispersion metrics serve as representatives of all sample recordings of a subject? In this paper, investigating our Parkinson dataset using well-known machine learning tools, as reported in the literature, sustained vowels are found to carry more PD-discriminative information. We have also found that rather than using each voice recording of each subject as an independent data sample, representing the samples of a subject with central tendency and dispersion metrics improves generalization of the predictive model.

There has been an increased interest in speech pattern analysis applications of Parkinsonism for building predictive telediagnosis and telemonitoring models. For this purpose, we have collected a wide variety of voice samples, including sustained vowels, words, and sentences compiled from a set of speaking exercises for People with Parkisons (PWP). There are two main issues in learning from such a dataset that consists of multiple speech recordings per subject: (i) how predictive these various types, e.g. sustained vowels vs. words, of voice samples are in Parkinsons Disease (PD) diagnosis? (ii) how well the central tendency and dispersion metrics serve as representatives of all sample recordings of a subject? In this paper, investigating our Parkinson dataset using well-known machine learning tools, as reported in the literature, sustained vowels are found to carry more PD-discriminative information. We have also found that rather than using each voice recording of each subject as an independent data sample, representing the samples of a subject with central tendency and dispersion metrics improves generalization of the predictive model.