Iris

Donated on 6/30/1988

A small classic dataset from Fisher, 1936. One of the earliest known datasets used for evaluating classification methods.

Dataset Characteristics

Tabular

Subject Area

Biology

Associated Tasks

Classification

Feature Type

Real

# Instances

150

# Features

Dataset Information

What do the instances in this dataset represent?

Each instance is a plant

Additional Information

This is one of the earliest datasets used in the literature on classification methods and widely used in statistics and machine learning. The data set contains 3 classes of 50 instances each, where each class refers to a type of iris plant. One class is linearly separable from the other 2; the latter are not linearly separable from each other. Predicted attribute: class of iris plant. This is an exceedingly simple domain. This data differs from the data presented in Fishers article (identified by Steve Chadwick, spchadwick@espeedaz.net ). The 35th sample should be: 4.9,3.1,1.5,0.2,"Iris-setosa" where the error is in the fourth feature. The 38th sample: 4.9,3.6,1.4,0.1,"Iris-setosa" where the errors are in the second and third features.

Has Missing Values?

Introductory Paper

The Iris data set: In search of the source of virginica

By A. Unwin, K. Kleinman. 2021

Published in Significance, 2021

Variables Table

Variable Name	Role	Type	Description	Units	Missing Values
sepal length	Feature	Continuous		cm	no
sepal width	Feature	Continuous		cm	no
petal length	Feature	Continuous		cm	no
petal width	Feature	Continuous		cm	no
class	Target	Categorical	class of iris plant: Iris Setosa, Iris Versicolour, or Iris Virginica		no

Rows per page

0 to 5 of 5

Baseline Model Performance

Dataset Files

File	Size
bezdekIris.data	4.4 KB
iris.data	4.4 KB
iris.names	2.9 KB
Index	105 Bytes

Papers Citing this Dataset

A Constructive Approach for One-Shot Training of Neural Networks Using Hypercube-Based Topological Coverings

By W. Daniel, Enoch Yeung. 2019

Published in ArXiv.

Convergence and Margin of Adversarial Training on Separable Data

By Zachary Charles, Shashank Rajput, Stephen Wright, Dimitris Papailiopoulos. 2019

Published in ArXiv.

CRAD: Clustering with Robust Autocuts and Depth

By Xin Huang, Yulia Gel. 2019

Published in 2017 IEEE International Conference on Data Mining (ICDM), 925--930} (2017).

Deep Spiking Neural Network with Spike Count based Learning Rule

By Jibin Wu, Yansong Chua, Malu Zhang, Qu Yang, Guoqi Li, Haizhou Li. 2019

Published in ArXiv.

Bounded Fuzzy Possibilistic Method

By Hossein Yazdani. 2019

Published in ArXiv.

Rows per page

0 to 5 of 352

Reviews

There are no reviews for this dataset yet.

Download (3.7 KB)

352 citations

1088068 views

Keywords

ecology

Creators

R. A. Fisher

DOI

10.24432/C56C76

License

This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.

This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.

Iris

Donated on 6/30/1988

Dataset Characteristics

Subject Area

Associated Tasks

Feature Type

# Instances

# Features

Dataset Information

Introductory Paper

Variables Table

Baseline Model Performance

Dataset Files

Papers Citing this Dataset

Reviews

Write a Review

Keywords

Creators

DOI

License