Synthetic Circle Data Set
Donated on 6/5/2024
This dataset comprises 10000 two-dimensional points arranged into 100 circles, each containing 100 points. It was designed to evaluate clustering algorithms, such as k-means, by providing a clear and structured clustering challenge.
Dataset Characteristics
Multivariate
Subject Area
Other
Associated Tasks
Classification, Clustering
Feature Type
Real
# Instances
10000
# Features
2
Dataset Information
Has Missing Values?
No
Introductory Paper
By Theo Conrads, Lukas Drexler, Joshua Könen, Daniel R. Schmidt, Melanie Schmidt. 2024
Published in 22nd International Symposium on Experimental Algorithms (SEA 2024)
Variables Table
Variable Name | Role | Type | Description | Units | Missing Values |
---|---|---|---|---|---|
Variable 1 | Feature | Continuous | no | ||
Variable 2 | Feature | Continuous | no | ||
Variable 3 | Target | Integer | no |
0 to 3 of 3
Dataset Files
File | Size |
---|---|
circles.txt | 209.6 KB |
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset synthetic_circle_data_set = fetch_ucirepo(id=1013) # data (as pandas dataframes) X = synthetic_circle_data_set.data.features y = synthetic_circle_data_set.data.targets # metadata print(synthetic_circle_data_set.metadata) # variable information print(synthetic_circle_data_set.variables)
Synthetic Circle Data Set [Dataset]. (2024). UCI Machine Learning Repository. https://doi.org/10.24432/C51909.
Keywords
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.