Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

NYSK Data Set
Download: Data Folder, Data Set Description

Abstract: NYSK (New York v. Strauss-Kahn) is a collection of English news articles about the case relating to allegations of sexual assault against the former IMF director Dominique Strauss-Kahn (May 2011).

Data Set Characteristics:  

Multivariate, Sequential, Text

Number of Instances:

10421

Area:

Social

Attribute Characteristics:

N/A

Number of Attributes:

7

Date Donated

2013-10-11

Associated Tasks:

Clustering

Missing Values?

N/A

Number of Web Hits:

27762


Source:

- Aurélien Lauf (alu '@' amisw.com)
- Leila Khouas (lkh '@' amisw.com)
- Mohamed Dermouche (mde '@' amisw.com)


Data Set Information:

Documents are first obtained via a Web search using AMIEI: an integrated platform for delivering enterprise intelligence, developed by AMI Software ([Web Link]) with the following query: ``dsk'' OR ``strauss-kahn'' OR ``strauss-khan''.

NYSK dataset was used to extract topic-sentiment correlation and evolution over time but may be used for other text mining tasks like topic extraction, sentiment analysis, etc.


Attribute Information:

Documents are then filtered and presented in XML format. All XML fields are self explanatory.


Relevant Papers:

(1) Mohamed Dermouche, Julien Velcin, Leila Khouas, and Sabine Loudcher. A Joint Model for Topic-Sentiment Evolution over Time. In Proceedings of The IEEE 14th International Conference on Data Mining (ICDM’2014), pages 773–778, Shenzhen, China, 2014. IEEE Computer Society.

(2) Mohamed Dermouche, Leila Khouas, Julien Velcin, and Sabine Loudcher. A Joint Model for Topic-Sentiment Modeling from Text. In Proceedings of The 30th ACM/SIGAPP Symposium On Applied Computing (SAC’2015), pages 819--824, Salamanca, Spain, 2015. ACM.



Citation Request:

Please refer to the Machine Learning Repository's citation policy


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML