Year Prediction MSD

Donated on 2/6/2011

Prediction of the release year of a song from audio features. Songs are mostly western, commercial tracks ranging from 1922 to 2011, with a peak in the year 2000s.

Dataset Characteristics

Multivariate

Subject Area

Other

Associated Tasks

Regression

Feature Type

Real

# Instances

515345

# Features

-

Dataset Information

Additional Information

You should respect the following train / test split: train: first 463,715 examples test: last 51,630 examples It avoids the 'producer effect' by making sure no song from a given artist ends up in both the train and test set.

Has Missing Values?

No

Variables Table

Variable NameRoleTypeDescriptionUnitsMissing Values
no
no
no
no
no
no
no
no
no
no

0 to 10 of 90

Additional Variable Information

90 attributes, 12 = timbre average, 78 = timbre covariance The first value is the year (target), ranging from 1922 to 2011. Features extracted from the 'timbre' features from The Echo Nest API. We take the average and covariance over all 'segments', each segment being described by a 12-dimensional timbre vector.

Dataset Files

FileSize
YearPredictionMSD.txt427.8 MB

Papers Citing this Dataset

Towards closing the gap between the theory and practice of SVRG

By Othmane Sebbouh, Nidham Gazagnadou, Samy Jelassi, Francis Bach, Robert Gower. 2019

Published in ArXiv.

ensmallen: a flexible C++ library for efficient function optimization

By Shikhar Bhardwaj, Ryan Curtin, Marcus Edel, Yannis Mentekidis, Conrad Sanderson. 2018

Published in ArXiv.

Controversy Rules - Discovering Regions Where Classifiers (Dis-)Agree Exceptionally

By Oren Zeev-Ben-Mordehai, Wouter Duivesteijn, Mykola Pechenizkiy. 2018

Published in ArXiv.

Anytime Stochastic Gradient Descent: A Time to Hear from all the Workers

By Nuwan Ferdinand, Stark Draper. 2018

Published in ArXiv.

0 to 5 of 7

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download (201.2 MB)
7 citations
15349 views

Keywords

Creators

T. Bertin-Mahieux

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy