
Nomao
Donated on 7/3/2012
Nomao collects data about places (name, phone, localization...) from many sources. Deduplication consists in detecting what data refer to the same place. Instances in the dataset compare 2 spots.
Dataset Characteristics
Univariate
Subject Area
Computer Science
Associated Tasks
Classification
Feature Type
Real
# Instances
34465
# Features
-
Dataset Information
Additional Information
The dataset has been enriched during the Nomao Challenge: http://www.nomao.com/labs/challenge organized along with the ALRA workshop (Active Learning in Real-world Applications): http://www.nomao.com/labs/alra held at the ECML-PKDD 2012 conference.
Has Missing Values?
Yes
Variables Table
Variable Name | Role | Type | Demographic | Description | Units | Missing Values |
---|---|---|---|---|---|---|
no | ||||||
no | ||||||
no | ||||||
no | ||||||
no | ||||||
no | ||||||
no | ||||||
no | ||||||
no | ||||||
no |
0 to 10 of 120
Additional Variable Information
120 attributes: 89 continuous, 31 nominal (including the attributes 'label' and 'id').
Candillier,Laurent and Lemaire,Vincent. (2012). Nomao. UCI Machine Learning Repository. https://doi.org/10.24432/C53G79.
@misc{misc_nomao_227, author = {Candillier,Laurent and Lemaire,Vincent}, title = {{Nomao}}, year = {2012}, howpublished = {UCI Machine Learning Repository}, note = {{DOI}: https://doi.org/10.24432/C53G79} }
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset nomao = fetch_ucirepo(id=227) # data (as pandas dataframes) X = nomao.data.features y = nomao.data.targets # metadata print(nomao.metadata) # variable information print(nomao.variables)
Creators
Laurent Candillier
Vincent Lemaire
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.