Center for Machine Learning and Intelligent Systems
About  Citation Policy  Donate a Data Set  Contact


Repository Web            Google
View ALL Data Sets

A study of Asian Religious and Biblical Texts Data Set
Download: Data Folder, Data Set Description

Abstract: Mainly from Project Gutenberg, we combine Upanishads, Yoga Sutras, Buddha Sutras, Tao Te Ching and Book of Wisdom, Book of Proverbs, Book of Ecclesiastes and Book of Ecclesiasticus

Data Set Characteristics:  

Multivariate, Text

Number of Instances:

590

Area:

Social

Attribute Characteristics:

Integer

Number of Attributes:

8265

Date Donated

2019-12-24

Associated Tasks:

Classification, Clustering

Missing Values?

N/A

Number of Web Hits:

15926


Source:

Preeti Sah
College of Computing and Information Sciences
Rochester Institute of Technology
Rochester, NY 14623
Email:ks3911 '@' rit.edu


Ernest Fokoue
School of Mathematical Sciences
Rochester Institute of Technology
Rochester, NY 14623
Email: epfeqa '@' rit.edu


Data Set Information:

Most of the sacred texts in this dataset were collected from Project Gutenberg. We herein provide the raw texts along with our pre-processed Document Term Matrices (DTM). For more details, please contact the authors


Attribute Information:

The attributes are just the words from the bag of words preprocessing of the mini-corpus made up of the 8 religious books considered in this study. There are 8265 words used


Relevant Papers:

@misc{sah2019asian,
title={What do Asian Religions Have in Common? An Unsupervised Text Analytics Exploration},
author={Preeti Sah and Ernest Fokoué},
year={2019},
eprint={1912.10847},
archivePrefix={arXiv},
primaryClass={cs.CL}
}



Citation Request:

@misc{sah2019asian,
title={What do Asian Religions Have in Common? An Unsupervised Text Analytics Exploration},
author={Preeti Sah and Ernest Fokoué},
year={2019},
eprint={1912.10847},
archivePrefix={arXiv},
primaryClass={cs.CL}
}


Supported By:

 In Collaboration With:

About  ||  Citation Policy  ||  Donation Policy  ||  Contact  ||  CML