Corel Image Features

Donated on 6/30/1999

This dataset contains image features extracted from a Corel image collection. Four sets of features are available based on the color histogram, color histogram layout, color moments, and co-occurence

Dataset Characteristics

Multivariate

Subject Area

Other

Associated Tasks

-

Feature Type

Real

# Instances

68040

# Features

-

Dataset Information

Additional Information

The original image collection was obtained from Corel at http://corel.digitalriver.com/. There are 68,040 photo images from various categories. Each set of features is stored in a separate file. For each file, a line corresponds to a single image. The first value in a line is is the image ID and the subsequent values are the feature vector (e.g. color histogram, etc.) of the image. The same image has the same ID in all files but the image ID is not the same as the image filename.

Has Missing Values?

No

Variables Table

Variable NameRoleTypeDescriptionUnitsMissing Values
no
no
no
no
no
no
no
no
no
no

0 to 10 of 89

Additional Variable Information

From each image four sets of features were extracted: - Color Histogram - Color Histogram Layout - Color Moments - Co-occurrence Texture Color Histogram: 32 dimensions (8 x 4 = H x S) - HSV color space is divided into 32 subspaces (32 colors : 8 ranges of H and 4 ranges of S). - the value in each dimension in a ColorHistogram of an image is the density of each color in the entire image. - Histogram intersection (overlap area between ColorHistograms of two images) can be used to measure the similarity between two images. Color Histogram Layout: 32 dimensions (4 x 2 x 4 = H x S x sub-images) - each image is divided into 4 sub-images (one horizontal split and one vertical split). - 4x2 Color Histogram for each sub-image is computed. - Histogram Intersection can be used to measure the similarity between two images. Color Moments: 9 dimensions (3 x 3) - the 9 values are: (one for each of H,S, and V in HSV color space) -- mean, -- standard deviation, and -- skewness. - Euclidean distance between Color Moments of two images can be used to represent the dis-similarity (distance) between two images. Co-occurrence Texture: 16 dimensions (4 x 4) - images are converted to 16 gray-scale images. - co-ocurrence in 4 directions is computed (horizontal, vertical, and two diagonal directions). the 16 values are: (one for each direction). -- Second Angular Moment, -- Contrast, I -- nverse Difference Moment, and -- Entropy. -Euclidean distance between ColorMoments of two images can be used to measure the dis-similarity (distance) between two images.

Dataset Files

FileSize
LayoutHistogram.asc.gz4.8 MB
ColorHistogram.asc.gz4.7 MB
CoocTexture.asc.gz4.3 MB
ColorMoments.asc.gz2.5 MB
CorelFeatures.data.html7.2 KB

0 to 5 of 13

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download (16.3 MB)
0 citations
2033 views

Creators

Michael Ortega-Binderberger

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy