LED Display Domain

Donated on 11/9/1988

From Classification and Regression Trees book; We provide here 2 C programs for generating sample databases

Dataset Characteristics

Multivariate, Data-Generator

Subject Area

Computer Science

Associated Tasks

Classification

Feature Type

Categorical

# Instances

# Features

Dataset Information

Additional Information

This simple domain contains 7 Boolean attributes and 10 concepts, the set of decimal digits. Recall that LED displays contain 7 light-emitting diodes -- hence the reason for 7 attributes. The problem would be easy if not for the introduction of noise. In this case, each attribute value has the 10% probability of having its value inverted. It's valuable to know the optimal Bayes rate for these databases. In this case, the misclassification rate is 26% (74% classification accuracy).

Has Missing Values?

Variables Table

Variable Name	Role	Type	Missing Values
Attribute1	Feature	Binary	no
Attribute2	Feature	Binary	no
Attribute3	Feature	Binary	no
Attribute4	Feature	Binary	no
Attribute5	Feature	Binary	no
Attribute6	Feature	Binary	no
Attribute7	Feature	Binary	no

Rows per page

0 to 7 of 7

Additional Variable Information

-- All attribute values are either 0 or 1, according to whether the corresponding light is on or not for the decimal digit. -- Each attribute (excluding the class attribute, which is an integer ranging between 0 and 9 inclusive) has a 10% percent chance of being inverted.

Dataset Files

File	Size
led-creator-+17.c	4.9 KB
led-creator.c	4.8 KB
led-creator.names	2.3 KB
led-creator-+17.names	1.7 KB
Index	213 Bytes

Download (5.3 KB)

0 citations

4857 views

Creators

L. Breiman

J.H. Friedman

R.A. Olshen

C.J. Stone

DOI

10.24432/C5FG61

License

This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.

This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.