NSF Research Award Abstracts 1990-2003

Donated on 11/17/2003

This data set consists of (a) 129,000 abstracts describing NSF awards for basic research, (b) bag-of-word data files extracted from the abstracts, (c) a list of words used for indexing the bag-of-word

Dataset Characteristics

Text

Subject Area

Other

Associated Tasks

Feature Type

# Instances

129000

# Features

Dataset Information

Additional Information

The abstracts, one per file, were furnished by the NSF (National Science Foundation). A sample abstract is shown in the next section. The bag-of-word data was produced by automatically processing the abstracts with a text analyzer called NSFAbst, built using VisualText. While most fields of the output are very accurate, the authors were not extracted from the Investigator: field with 100% accuracy, due to wide variability in that field. The word list came from a separate process, and may not include all the words of interest in the abstracts.

Has Missing Values?

Dataset Files

File	Size
nsfawards.data.html	5.8 KB
nsfawards.html	1.4 KB
nsfabs_part1_out.zip	57 Bytes
nsfabs_part2_out.zip	57 Bytes
nsfabs_part3_out.zip	57 Bytes

Rows per page

0 to 5 of 9

Reviews

There are no reviews for this dataset yet.

Download (4.1 KB)

0 citations

1083 views

Creators

Michael Pazzani

Amnon Meyers

DOI

10.24432/C55C9N

License

This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.

This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.