MetroPT-3 Dataset
Donated on 3/21/2023
From a metro train in an operational context, readings from pressure, temperature, motor current, and air intake valves were collected from a compressor's Air Production Unit (APU). This dataset reveals real predictive maintenance challenges encountered in the industry. It can be used for failure predictions, anomaly explanations, and other tasks.
Dataset Characteristics
Tabular, Multivariate, Time-Series
Subject Area
Computer Science
Associated Tasks
Classification
Feature Type
Real
# Instances
1516948
# Features
15
Dataset Information
For what purpose was the dataset created?
It can be used for failure predictions, anomaly explanations, and other tasks.
Who funded the creation of the dataset?
This work was supported by the CHIST-ERA grant CHIST-ERA-19-XAI-012 and project CHIST-ERA/0004/2019, funded by FCT
What do the instances in this dataset represent?
Data Set Information: The dataset was collected to support the development of predictive maintenance, anomaly detection, and remaining useful life (RUL) prediction models for compressors using deep learning and machine learning methods. It consists of multivariate time series data obtained from several analogue and digital sensors installed on the compressor of a train. The data span between February and August 2020 and includes 15 signals, such as pressures, motor current, oil temperature, and electrical signals of air intake valves. The monitoring and logging of industrial equipment events, such as temporal behaviour and fault events, were obtained from records generated by the sensors. The data were logged at 1Hz by an onboard embedded device. You can find a schematic diagram of the air production unit of the compressor system in Figure 4 of the accompanying paper [1]. Also, the paper [2] provides a detailed examination of data collection and specifications of various types of potential failures in an air compressor system.
Are there recommended data splits?
The first month can be used for training the model, and the rest is for testing. This dataset is eligible for incremental training.
Does the dataset contain data that might be considered sensitive in any way?
no
Was there any data preprocessing performed?
Data segmentation, Normalization, Feature Extraction
Additional Information
Failure Information: The dataset is unlabeled, but the failure reports provided by the company are available in the following table. This allows for evaluating the effectiveness of anomaly detection, failure prediction, and RUL estimation algorithms. Nr. Start Time End Time Failure Severity Report #1 4/18/2020 0:00 4/18/2020 23:59 Air leak High stress #1 5/29/2020 23:30 5/30/2020 6:00 Air Leak High stress Maintenance on 30Apr at 12:00 #3 6/5/2020 10:00 6/7/2020 14:30 Air Leak High stress Maintenance on 8Jun at 16:00 #4 7/15/2020 14:30 7/15/2020 19:00 Air Leak High stress Maintenance on 16Jul at 00:00
Has Missing Values?
No
Introductory Paper
By Davari, N., Veloso, B., Ribeiro, R.P., Pereira, P.M., Gama. 2021
Published in 2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA)
Variables Table
Variable Name | Role | Type | Description | Units | Missing Values |
---|---|---|---|---|---|
index | ID | Integer | index of data | no | |
timestamp | Other | Categorical | date time | no | |
TP2 | Feature | Continuous | bar | no | |
TP3 | Feature | Continuous | bar | no | |
H1 | Feature | Continuous | bar | no | |
DV_pressure | Feature | Continuous | bar | no | |
Reservoirs | Feature | Continuous | bar | no | |
Oil_temperature | Feature | Continuous | 0C | no | |
Motor_current | Feature | Continuous | A | no | |
COMP | Feature | Continuous | no |
0 to 10 of 17
Additional Variable Information
Attribute Information: The dataset consists of 1516948data points collected at 0.1Hz from February to August 2020 and is described by 15 features from 7 analogue (1-7) and 8 digital (8-15) sensors: 1. TP2 (bar) – the measure of the pressure on the compressor. 2. TP3 (bar) – the measure of the pressure generated at the pneumatic panel. 3. H1 (bar) – the measure of the pressure generated due to pressure drop when the discharge of the cyclonic separator filter occurs. 4. DV pressure (bar) – the measure of the pressure drop generated when the towers discharge air dryers; a zero reading indicates that the compressor is operating under load. 5. Reservoirs (bar) – the measure of the downstream pressure of the reservoirs, which should be close to the pneumatic panel pressure (TP3). 6. Motor Current (A) – the measure of the current of one phase of the three-phase motor; it presents values close to 0A - when it turns off, 4A - when working offloaded, 7A - when working under load, and 9A - when it starts working. 7. Oil Temperature (ºC) – the measure of the oil temperature on the compressor. 8. COMP - the electrical signal of the air intake valve on the compressor; it is active when there is no air intake, indicating that the compressor is either turned off or operating in an offloaded state. 9. DV electric – the electrical signal that controls the compressor outlet valve; it is active when the compressor is functioning under load and inactive when the compressor is either off or operating in an offloaded state. 10. TOWERS – the electrical signal that defines the tower responsible for drying the air and the tower responsible for draining the humidity removed from the air; when not active, it indicates that tower one is functioning; when active, it indicates that tower two is in operation. 11. MPG – the electrical signal responsible for starting the compressor under load by activating the intake valve when the pressure in the air production unit (APU) falls below 8.2 bar; it activates the COMP sensor, which assumes the same behaviour as the MPG sensor. 12. LPS – the electrical signal that detects and activates when the pressure drops below 7 bars. 13. Pressure Switch - the electrical signal that detects the discharge in the air-drying towers. 14. Oil Level – the electrical signal that detects the oil level on the compressor; it is active when the oil is below the expected values. 15. Caudal Impulse – the electrical signal that counts the pulse outputs generated by the absolute amount of air flowing from the APU to the reservoirs.
Dataset Files
File | Size |
---|---|
MetroPT3(AirCompressor).csv | 208.2 MB |
Data Description_Metro.pdf | 79.3 KB |
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset metropt_3_dataset = fetch_ucirepo(id=791) # data (as pandas dataframes) X = metropt_3_dataset.data.features y = metropt_3_dataset.data.targets # metadata print(metropt_3_dataset.metadata) # variable information print(metropt_3_dataset.variables)
Davari, N., Veloso, B., Ribeiro , R., & Gama, J. (2021). MetroPT-3 Dataset [Dataset]. UCI Machine Learning Repository. https://doi.org/10.24432/C5VW3R.
Creators
Narjes Davari
narjes.davari@inesctec.pt
Institute for Systems and Computer Engineering, Technology and Science (INESC TEC)
Bruno Veloso
bveloso@fep.up.pt
INESC TEC - Laboratory of Artificial Intelligence and Decision Support, and Faculty of Economics, University of Porto, Portugal
Rita Ribeiro
rpribeiro@fc.up.pt
INESC TEC-Laboratory of Artificial Intelligence and Decision Support, and Faculty of Sciences, University of Porto, Portugal
Joao Gama
jgama@fep.up.pt
INESC TEC - Laboratory of Artificial Intelligence and Decision Support, and Faculty of Economics, University of Porto, Portugal.
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.