SIFT10M

Donated on 2/22/2016

In SIFT10M, each data point is a SIFT feature which is extracted from Caltech-256 by the open source VLFeat library. The corresponding patches of the SIFT features are provided.

Dataset Characteristics

Multivariate

Subject Area

Computer Science

Associated Tasks

Causal-Discovery

Feature Type

Integer

# Instances

11164866

# Features

-

Dataset Information

Additional Information

In SIFT10M, the titles of the png files indicate the columns position of the SIFT features. This data set has been used for evaluating the approximate nearest neighbour search methods. The patches can be used for visualisation purpose and helps for analysing the performance of the corresponding approximate nearest neighbour search methods.

Has Missing Values?

No

Variables Table

Variable NameRoleTypeDescriptionUnitsMissing Values
no
no
no
no
no
no
no
no
no
no

0 to 10 of 128

Additional Variable Information

Each SIFT feature is a 128D column, and the corresponding patch is saved in 41*41 png format. The png files are compressed into 307 tar files for downloading.

Dataset Files

FileSize
SIFT10M.tar.gz7.3 GB
README.txt1.3 KB

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download (7.3 GB)
0 citations
1453 views

Creators

Xiping Fu

Brendan McCane

Steven Mills

Michael Albert

Lech Szymanski

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy