181 early modern English plays: Transcriptions of early editions in TEI encoding
Donated on 9/27/2022
English plays (1585-1610), transcribed from early printed editions containing 181 plays. The plays as “samples” and the frequencies of the words appearing in those plays as “features” in the dataset.
Dataset Characteristics
Tabular
Subject Area
Other
Associated Tasks
Regression
Feature Type
-
# Instances
181
# Features
-
Dataset Information
For what purpose was the dataset created?
The creation of new computational methods to provide fresh insights on literary styles is a hot topic of research. There are particular challenges when the number of samples is small in comparison with the number of variables. One problem of interest to literary historians is the date of the first performance of a play of Shakespeare’s time. Currently this must usually be guessed with reference to multiple indirect external sources, or to some aspect of the content or style of the play.
Who funded the creation of the dataset?
Australian Research Council’s Discovery Projects funding scheme.
What do the instances in this dataset represent?
The dataset contains the plays as “samples”, and the frequencies of the words appearing in those plays as “features”.
Was there any data preprocessing performed?
Detail descriptions of the Pre-processing and software used for generating the data can be found in Sec. 2. "Materials and methods" of the corresponding article at: https://doi.org/10.1016/j.eswa.2022.116903.
Has Missing Values?
No
Introductory Paper
By P. Moscato, Hugh Craig, G. Egan, Mohammad Nazmul Haque, Kevin Huang, Julia Sloan, J. C. D. Oliveira. 2021
Published in Expert Syst. Appl.
Dataset Files
File | Size |
---|---|
181_plays_1585-610_t.csv | 18.2 MB |
181_plays_1585-1610-Metadata.csv | 29.1 KB |
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset 181_early_modern_english_plays_transcriptions_of_early_editions_in_tei_encoding = fetch_ucirepo(id=747) # data (as pandas dataframes) X = 181_early_modern_english_plays_transcriptions_of_early_editions_in_tei_encoding.data.features y = 181_early_modern_english_plays_transcriptions_of_early_editions_in_tei_encoding.data.targets # metadata print(181_early_modern_english_plays_transcriptions_of_early_editions_in_tei_encoding.metadata) # variable information print(181_early_modern_english_plays_transcriptions_of_early_editions_in_tei_encoding.variables)
Egan, G., Antonia, A., Greatley-Hirsch, B., & Craig, H. (2021). 181 early modern English plays: Transcriptions of early editions in TEI encoding [Dataset]. UCI Machine Learning Repository. https://doi.org/10.24432/C5XW4R.
Creators
Gabriel Egan
gegan@dmu.ac.uk
De Montfort University, UK
Alexis Antonia
alexis.antonia@newcastle.edu.au
The University of Newcastle, Australia
Brett Greatley-Hirsch
B.D.GreatleyHirsch@leeds.ac.uk
University of Leeds, UK
Hugh Craig
Hugh.Craig@newcastle.edu.au
The University of Newcastle
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.