Skip to content

Latest commit

 

History

History
26 lines (16 loc) · 1.37 KB

README.md

File metadata and controls

26 lines (16 loc) · 1.37 KB

Iris Data

This sample contains the observations about iris sepal and petal length from R. A. Fisher's classic 1936 paper, "The Use of Multiple Measurements in Taxonomic Problems". The paper describes categorizing plant varieties by using observable metrics of plants. The data is often used to demonstrate machine learning categorization algorithms.

There are 150 observations in the data set, each with a class identifying a species of iris flower upon which the observations were made.

Fields

The data has these fields:

  • SepalLengthCM: length of the sepal sample in centimeters
  • SepalWidthCM: width of the sepal sample in centimeters
  • PetalLengthCM: length of the petal sample in centimeters
  • PetalWidthCM: width of the petal sample in centimeters
  • Class: one of "Iris-setosa", "Iris-versicolor", or "Iris-verginica"

Source

Fisher, R. A. ("The use of multiple measurements in taxonomic problems")[https://doi.org/10.1111/j.1469-1809.1936.tb02137.x] Annual Eugenics, 7, Part II, 179-188 (1936); also in "Contributions to Mathematical Statistics" (John Wiley, NY, 1950).

Dua, D. and Graff, C. (2019). (UCI Machine Learning Repository)[http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.

File formats

A CSV and Feather file are made available.