Document Understanding

Donated on 10/31/1994

Five concepts, expressed as predicates, to be learned

Dataset Characteristics

-

Subject Area

Other

Associated Tasks

-

Feature Type

-

# Instances

-

# Features

-

Dataset Information

Additional Information

In the experimentation, 30 single page documents were considered. They are copies of letters sent by Olivetti. Six trials were performed by randomly selecting 20 documents for the training set and 10 for the test set. Each document is identified by a letter (A to Z) or a pair of letters (AA, AB, AC, AD). Trial Training documents 1 A B C D E F G H I J K L M N O P Q R S T 2 C D E F G H I M P R S V X Y W Z AA AB AC AD 3 C D E F G H I J K P R S T U V Y W AA AB AC 4 A B C D E F G J L M N O P Q T V X Z AB AD 5 A B E F G I J K M N O P Q R T V X Z AA AD 6 A B C D E F G I J M Q S T X Y Z AA AB AC AD

Has Missing Values?

No

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download
0 citations
1412 views

Creators

Donato Malerba

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy