Nomao

Donated on 7/3/2012

Nomao collects data about places (name, phone, localization...) from many sources. Deduplication consists in detecting what data refer to the same place. Instances in the dataset compare 2 spots.

Dataset Characteristics

Univariate

Subject Area

Computer Science

Associated Tasks

Classification

Feature Type

Real

# Instances

34465

# Features

-

Dataset Information

Additional Information

The dataset has been enriched during the Nomao Challenge: http://www.nomao.com/labs/challenge organized along with the ALRA workshop (Active Learning in Real-world Applications): http://www.nomao.com/labs/alra held at the ECML-PKDD 2012 conference.

Has Missing Values?

Yes

Variables Table

Variable NameRoleTypeDescriptionUnitsMissing Values
no
no
no
no
no
no
no
no
no
no

0 to 10 of 120

Additional Variable Information

120 attributes: 89 continuous, 31 nominal (including the attributes 'label' and 'id').

Dataset Files

FileSize
Nomao/Nomao.data13.7 MB
Nomao/Nomao.names8.1 KB

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download (1.6 MB)
0 citations
1425 views

Creators

Laurent Candillier

Vincent Lemaire

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy