Title: Plants Abstract: Data has been extracted from the USDA plants database. It contains all plants (species and genera) in the database and the states of USA and Canada where they occur. ----------------------------------------------------------------------- Data Set Characteristics: Multivariate Attribute Characteristics: Categorical Associated Tasks: Clustering Number of Instances: 22632 Number of Attributes: 70 Missing Values? Yes Area: Life Date Donated: 2008-12-31 ----------------------------------------------------------------------- Source: Original source: USDA plants database: http://plants.usda.gov/index.html Extracted and encoded by W. Hämäläinen, Department of Computer Science, University of Helsinki, Finland. whamalai '@' cs.helsinki.fi ----------------------------------------------------------------------- Data Set Information: The data is in the transactional form. It contains the Latin names (species or genus) and state abbreviations. ----------------------------------------------------------------------- Attribute Information: Each row contains a Latin name (species or genus) and a list of state abbreviations. ----------------------------------------------------------------------- Relevant Papers: Hämäläinen, W. and Nykänen, M.: Efficient discovery of statistically significant association rules. Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), pp. 203-212. IEEE Computer Society 2008. ----------------------------------------------------------------------- Citation Request: Even if the data is processed, it is good to give a reference to the original source: USDA, NRCS. 2008. The PLANTS Database (http://plants.usda.gov/, 31 December 2008). National Plant Data Center, Baton Rouge, LA 70874-4490 USA.