Diabetic Retinopathy Debrecen

Donated on 11/2/2014

This dataset contains features extracted from the Messidor image set to predict whether an image contains signs of diabetic retinopathy or not.

Dataset Characteristics

Multivariate

Subject Area

Health and Medicine

Associated Tasks

Classification

Feature Type

Integer, Real

# Instances

1151

# Features

19

Dataset Information

What do the instances in this dataset represent?

Medical patients

Additional Information

This dataset contains features extracted from the Messidor image set to predict whether an image contains signs of diabetic retinopathy or not. All features represent either a detected lesion, a descriptive feature of a anatomical part or an image-level descriptor. The underlying method image analysis and feature extraction as well as our classification technique is described in Antal and Hajdu, Knowledge-Based Systems, 2014. The image set (Messidor) is available at http://messidor.crihan.fr/index-en.php.

Has Missing Values?

No

Introductory Paper

An ensemble-based system for automatic screening of diabetic retinopathy

By B. Antal, A. Hajdu. 2014

Published in Knowledge-Based Systems

Variables Table

Variable NameRoleTypeDescriptionUnitsMissing Values
qualityFeatureBinaryThe binary result of quality assessment. 0 = bad quality 1 = sufficient quality.no
pre_screeningFeatureBinaryThe binary result of pre-screening, where 1 indicates severe retinal abnormality and 0 its lack.no
ma1FeatureIntegerma1 - ma-6 contain the results of MA detection. Each feature value stand for the number of MAs found at the confidence levels alpha = 0.5, . . . , 1, respectively.no
ma2FeatureIntegerno
ma3FeatureIntegerno
ma4FeatureIntegerno
ma5FeatureIntegerno
ma6FeatureIntegerno
exudate1FeatureContinuousexudate1 - exudate8 contain the same information as 2-7) for exudates. However, as exudates are represented by a set of points rather than the number of pixels constructing the lesions, these features are normalized by dividing the number of lesions with the diameter of the ROI to compensate different image sizes.no
exudate2FeatureContinuousno

0 to 10 of 20

Additional Variable Information

0) The binary result of quality assessment. 0 = bad quality 1 = sufficient quality. 1) The binary result of pre-screening, where 1 indicates severe retinal abnormality and 0 its lack. 2-7) The results of MA detection. Each feature value stand for the number of MAs found at the confidence levels alpha = 0.5, . . . , 1, respectively. 8-15) contain the same information as 2-7) for exudates. However, as exudates are represented by a set of points rather than the number of pixels constructing the lesions, these features are normalized by dividing the number of lesions with the diameter of the ROI to compensate different image sizes. 16) The euclidean distance of the center of the macula and the center of the optic disc to provide important information regarding the patient’s condition. This feature is also normalized with the diameter of the ROI. 17) The diameter of the optic disc. 18) The binary result of the AM/FM-based classification. 19) Class label. 1 = contains signs of DR (Accumulative label for the Messidor classes 1, 2, 3), 0 = no signs of DR.

Dataset Files

FileSize
messidor_features.arff114.5 KB

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download (46.5 KB)
1 citations
9210 views

Creators

Balint Antal

Andras Hajdu

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy