Spoken Arabic Digit

Donated on 9/12/2010

This dataset contains timeseries of mel-frequency cepstrum coefficients (MFCCs) corresponding to spoken Arabic digits. Includes data from 44 male and 44 female native Arabic speakers.

Dataset Characteristics

Multivariate, Time-Series

Subject Area

Other

Associated Tasks

Classification

Feature Type

Real

# Instances

8800

# Features

13

Dataset Information

Additional Information

Dataset from 8800(10 digits x 10 repetitions x 88 speakers) time series of 13 Frequency Cepstral Coefficients (MFCCs) had taken from 44 males and 44 females Arabic native speakers between the ages 18 and 40 to represent ten spoken Arabic digit.

Has Missing Values?

No

Variable Information

Each line on the data base represents 13 MFCCs coefficients in the increasing order separated by spaces. This corresponds to one analysis frame. The 13 Mel Frequency Cepstral Coefficients (MFCCs) are computed with the following conditions; Sampling rate: 11025 Hz, 16 bits Window applied: hamming Filter pre-emphasized: 1-0.97Z^(-1)

Dataset Files

FileSize
Train_Arabic_Digit.txt27 MB
Test_Arabic_Digit.txt8.9 MB
graphic.jpg31.8 KB
documentation.html20.8 KB

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download (14 MB)
0 citations
2549 views

Creators

Mouldi Bedda

Nacereddine Hammami

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy