NYSK

Donated on 10/10/2013

NYSK (New York v. Strauss-Kahn) is a collection of English news articles about the case relating to allegations of sexual assault against the former IMF director Dominique Strauss-Kahn (May 2011).

Dataset Characteristics

Multivariate, Sequential, Text

Subject Area

Social Science

Associated Tasks

Clustering

Feature Type

-

# Instances

10421

# Features

7

Dataset Information

Additional Information

Documents are first obtained via a Web search using AMIEI: an integrated platform for delivering enterprise intelligence, developed by AMI Software (http://www.amisw.com/en) with the following query: ``dsk'' OR ``strauss-kahn'' OR ``strauss-khan''. NYSK dataset was used to extract topic-sentiment correlation and evolution over time but may be used for other text mining tasks like topic extraction, sentiment analysis, etc.

Has Missing Values?

No

Variable Information

Documents are then filtered and presented in XML format. All XML fields are self explanatory.

Dataset Files

FileSize
nysk.xml52.3 MB

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download (17.5 MB)
0 citations
1267 views

Creators

Aurlien Lauf

Leila Khouas

Mohamed Dermouche

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy