Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact

Repository Web            Google
View ALL Data Sets

× Check out the beta version of the new UCI Machine Learning Repository we are currently testing! Contact us if you have any issues, questions, or concerns. Click here to try out the new site.

Parkinson's Disease Classification Data Set
Download: Data Folder, Data Set Description

Abstract: The data used in this study were gathered from 188 patients with PD (107 men and 81 women) with ages ranging from 33 to 87 (65.1±10.9).

Data Set Characteristics:  


Number of Instances:




Attribute Characteristics:

Integer, Real

Number of Attributes:


Date Donated


Associated Tasks:


Missing Values?


Number of Web Hits:



C. Okan Sakar a, Gorkem Serbes b, Aysegul Gunduz c,
Hunkar C. Tunc a, Hatice Nizam d, Betul Erdogdu Sakar e, Melih Tutuncu c,
Tarkan Aydin a, M. Erdem Isenkul d, Hulya Apaydin c
a Department of Computer Engineering, Bahcesehir University, Istanbul, 34353, Turkey
b Department of Biomedical Engineering, Yildiz Technical University, Istanbul, 34220, Turkey
c Department of Neurology, CerrahpaÅŸa Faculty of Medicine, Istanbul University, Istanbul 34098, Turkey
d Department of Computer Engineering, Istanbul University, Istanbul, 34320, Turkey
e Department of Software Engineering, Bahcesehir University, Istanbul, 34353, Turkey
e-mails: {okan.sakar '@'; gserbes '@';
draysegulgunduz '@'; hunkarcan.tunc '@'; haticenizam92 '@'; betul.erdogdu '@'; tutuncumelih '@'; tarkan.aydin '@'; eisenkul '@'; hulyapay '@'}

Data Set Information:

The data used in this study were gathered from 188 patients with PD (107 men and 81 women) with ages ranging from 33 to 87 (65.1±10.9) at the Department of Neurology in Cerrahpaşa Faculty of Medicine, Istanbul University. The control group consists of 64 healthy individuals (23 men and 41 women) with ages varying between 41 and 82 (61.1±8.9). During the data collection process, the microphone is set to 44.1 KHz and following the physician’s examination, the sustained phonation of the vowel /a/ was collected from each subject with three repetitions.

Attribute Information:

Various speech signal processing algorithms including Time Frequency Features, Mel Frequency Cepstral Coefficients (MFCCs), Wavelet Transform based Features, Vocal Fold Features and TWQT features have been applied to the speech recordings of Parkinson's Disease (PD) patients to extract clinically useful information for PD assessment.

Relevant Papers:

Provide references to papers that have cited this data set in the past (if any).

Citation Request:

If you use this dataset, please cite:
Sakar, C.O., Serbes, G., Gunduz, A., Tunc, H.C., Nizam, H., Sakar, B.E., Tutuncu, M., Aydin, T., Isenkul, M.E. and Apaydin, H., 2018. A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor wavelet transform. Applied Soft Computing, DOI: [Web Link]

Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML