Statlog (Shuttle)

The shuttle dataset contains 9 attributes all of which are numerical. Approximately 80% of the data belongs to class 1

Dataset Characteristics

Multivariate

Subject Area

Physics and Chemistry

Associated Tasks

Classification

Feature Type

Integer

# Instances

58000

# Features

Dataset Information

Additional Information

Approximately 80% of the data belongs to class 1. Therefore the default accuracy is about 80%. The aim here is to obtain an accuracy of 99 - 99.9%. The examples in the original dataset were in time order, and this time order could presumably be relevant in classification. However, this was not deemed relevant for StatLog purposes, so the order of the examples in the original dataset was randomised, and a portion of the original dataset removed for validation purposes.

Has Missing Values?

Variables Table

Variable Name	Role	Type	Missing Values
Rad Flow	Feature	Integer	no
Fpv Close	Feature	Integer	no
Fpv Open	Feature	Integer	no
High	Feature	Integer	no
Bypass	Feature	Integer	no
Bpv Close	Feature	Integer	no
Bpv Open	Feature	Integer	no
class	Target	Integer	no

Rows per page

0 to 8 of 8

Additional Variable Information

The shuttle dataset contains 9 attributes all of which are numerical. The first one being time. The last column is the class which has been coded as follows : 1 Rad Flow 2 Fpv Close 3 Fpv Open 4 High 5 Bypass 6 Bpv Close 7 Bpv Open

Baseline Model Performance

Dataset Files

File	Size
shuttle.tst	375.3 KB
shuttle.trn.Z	312.4 KB
shuttle.doc	1.2 KB
Index	146 Bytes

Reviews

There are no reviews for this dataset yet.

Download (420.3 KB)

0 citations

12798 views

DOI

10.24432/C5WS31

License

This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.

This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.