Drug Reviews (Drugs.com)
Donated on 10/3/2018
The dataset provides patient reviews on specific drugs along with related conditions and a 10 star patient rating reflecting overall patient satisfaction.
Dataset Characteristics
Multivariate, Text
Subject Area
Health and Medicine
Associated Tasks
Classification, Regression, Clustering
Feature Type
Integer
# Instances
215063
# Features
6
Dataset Information
Additional Information
The dataset provides patient reviews on specific drugs along with related conditions and a 10 star patient rating reflecting overall patient satisfaction. The data was obtained by crawling online pharmaceutical review sites. The intention was to study (1) sentiment analysis of drug experience over multiple facets, i.e. sentiments learned on specific aspects such as effectiveness and side effects, (2) the transferability of models among domains, i.e. conditions, and (3) the transferability of models among different data sources (see 'Drug Review Dataset (Druglib.com)'). The data is split into a train (75%) a test (25%) partition (see publication) and stored in two .tsv (tab-separated-values) files, respectively. Important notes: When using this dataset, you agree that you 1) only use the data for research purposes 2) don't use the data for any commerical purposes 3) don't distribute the data to anyone else 4) cite us
Has Missing Values?
No
Introductory Paper
By F. Gräßer, Surya Kallumadi, H. Malberg, S. Zaunseder. 2018
Published in Digital Humanities Conference
Variables Table
Variable Name | Role | Type | Description | Units | Missing Values |
---|---|---|---|---|---|
id | ID | Integer | no | ||
drugName | Feature | Categorical | no | ||
condition | Feature | Categorical | no | ||
review | Feature | Categorical | no | ||
rating | Feature | Categorical | no | ||
date | Feature | Date | no | ||
usefulCount | Feature | Categorical | no |
0 to 7 of 7
Additional Variable Information
1. drugName (categorical): name of drug 2. condition (categorical): name of condition 3. review (text): patient review 4. rating (numerical): 10 star patient rating 5. date (date): date of review entry 6. usefulCount (numerical): number of users who found review useful
Dataset Files
File | Size |
---|---|
drugsComTrain_raw.tsv | 80.4 MB |
drugsComTest_raw.tsv | 26.8 MB |
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset drug_reviews_drugs_com = fetch_ucirepo(id=462) # data (as pandas dataframes) X = drug_reviews_drugs_com.data.features y = drug_reviews_drugs_com.data.targets # metadata print(drug_reviews_drugs_com.metadata) # variable information print(drug_reviews_drugs_com.variables)
Kallumadi, S. & Grer, F. (2018). Drug Reviews (Drugs.com) [Dataset]. UCI Machine Learning Repository. https://doi.org/10.24432/C5SK5S.
Creators
Surya Kallumadi
Felix Grer
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.