Browse Through:
Default Task
Classification (7)Regression (0)Clustering (5)Other (0)
Attribute Type
Categorical (0)Numerical (7)Mixed (0)
Data Type - Undo
Multivariate (35)Univariate (1)Sequential (2)Time-Series (14)Text (9)Domain-Theory (2)Other (1)
Area - Undo
Life Sciences (0)Physical Sciences (1)CS / Engineering (9)Social Sciences (0)Business (2)Game (0)Other (1)
# Attributes - Undo
Less than 10 (8)10 to 100 (6)Greater than 100 (9)
# Instances - Undo
Less than 100 (1)100 to 1000 (2)Greater than 1000 (9)
Format Type
Matrix (5)Non-Matrix (4)
9 Data Sets
Table View List View
Name
Data Types
Attribute Types
# Instances
# Attributes
Year
Reuter_50_50
Multivariate, Text, Domain-Theory
Classification, Clustering
Real
2500
10000
2011
TTC-3600: Benchmark dataset for Turkish text categorization
Text
Integer
3600
4814
2017
Opinion Corpus for Lebanese Arabic Reviews (OCLAR)
Classification
3916
2019
Detect Malware Types
Multivariate, Time-Series, Text
7107
280
NIPS Conference Papers 1987-2015
Clustering
11463
5812
2016
DeliciousMIL: A Data Set for Multi-Label Multi-Instance Learning with Instance Labels
12234
8519
Health News in Twitter
58000
25000
2018
Victorian Era Authorship Attribution
93600
1000
YouTube Multiview Video Games Dataset
Multivariate, Text
Integer, Real
120000
1000000
2013
Supported By:
In Collaboration With: