Speaker Accent Recognition
Donated on 3/3/2020
Data set featuring single English words read by speakers from six different countries for accent detection and recognition
Dataset Characteristics
Multivariate
Subject Area
Social Science
Associated Tasks
Classification
Feature Type
Real
# Instances
329
# Features
12
Dataset Information
Has Missing Values?
No
Variable Information
Response variable: language = {ES, FR, GE, IT, UK, US} The six possible accents considered Explanatory variables: X1, X2, ..., X12 Obtained using MFCC on the original time domain soundtrack of the maximum 1s of reading of a word
Dataset Files
File | Size |
---|---|
IT-F-1-1.wav | 105 KB |
GE-M-1-12.wav | 98 KB |
US-M-5-9.wav | 97 KB |
US-F-2-9.wav | 96 KB |
US-M-2-3.wav | 93 KB |
0 to 5 of 11
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset speaker_accent_recognition = fetch_ucirepo(id=518) # data (as pandas dataframes) X = speaker_accent_recognition.data.features y = speaker_accent_recognition.data.targets # metadata print(speaker_accent_recognition.metadata) # variable information print(speaker_accent_recognition.variables)
Speaker Accent Recognition [Dataset]. (2020). UCI Machine Learning Repository. https://doi.org/10.24432/C52329.
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.