NSF Research Award Abstracts 1990-2003

Donated on 11/17/2003

This data set consists of (a) 129,000 abstracts describing NSF awards for basic research, (b) bag-of-word data files extracted from the abstracts, (c) a list of words used for indexing the bag-of-word

Dataset Characteristics

Text

Subject Area

Other

Associated Tasks

-

Feature Type

-

# Instances

129000

# Features

-

Dataset Information

Additional Information

The abstracts, one per file, were furnished by the NSF (National Science Foundation). A sample abstract is shown in the next section. The bag-of-word data was produced by automatically processing the abstracts with a text analyzer called NSFAbst, built using VisualText. While most fields of the output are very accurate, the authors were not extracted from the Investigator: field with 100% accuracy, due to wide variability in that field. The word list came from a separate process, and may not include all the words of interest in the abstracts.

Has Missing Values?

No

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download
0 citations
1333 views

Creators

Michael Pazzani

Amnon Meyers

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy