A study of Asian Religious and Biblical Texts Data Set
Download: Data Folder, Data Set Description
Abstract: Mainly from Project Gutenberg, we combine Upanishads, Yoga Sutras, Buddha Sutras, Tao Te Ching and Book of Wisdom, Book of Proverbs, Book of Ecclesiastes and Book of Ecclesiasticus
|
|
Data Set Characteristics: |
Multivariate, Text |
Number of Instances: |
590 |
Area: |
Social |
Attribute Characteristics: |
Integer |
Number of Attributes: |
8265 |
Date Donated |
2019-12-24 |
Associated Tasks: |
Classification, Clustering |
Missing Values? |
N/A |
Number of Web Hits: |
46426 |
Source:
Preeti Sah
College of Computing and Information Sciences
Rochester Institute of Technology
Rochester, NY 14623
Email:ks3911 '@' rit.edu
Ernest Fokoue
School of Mathematical Sciences
Rochester Institute of Technology
Rochester, NY 14623
Email: epfeqa '@' rit.edu
Data Set Information:
Most of the sacred texts in this dataset were collected from Project Gutenberg. We herein provide the raw texts along with our pre-processed Document Term Matrices (DTM). For more details, please contact the authors
Attribute Information:
The attributes are just the words from the bag of words preprocessing of the mini-corpus made up of the 8 religious books considered in this study. There are 8265 words used
Relevant Papers:
@misc{sah2019asian,
title={What do Asian Religions Have in Common? An Unsupervised Text Analytics Exploration},
author={Preeti Sah and Ernest Fokoué},
year={2019},
eprint={1912.10847},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
Citation Request:
@misc{sah2019asian,
title={What do Asian Religions Have in Common? An Unsupervised Text Analytics Exploration},
author={Preeti Sah and Ernest Fokoué},
year={2019},
eprint={1912.10847},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
|