An Integrated Approach for Massive Sequential Data Processing in Civil Infrastructure Operation and Maintenance

This paper presents an extract-transform-load (ETL) approach based on multilayer task execution for processing massive sequential data collected from infrastructure operation and maintenance. The proposed approach consists of ETL task partition, execution mode selection, and ETL modeling. The task p...

Full description

Bibliographic Details
Main Authors: Gang Yu, Jiajun Liu, Juan Du, Min Hu, Vijayan Sugumaran
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8316849/
id doaj-5da6c77429154e12bc2a0d51f7551f11
record_format Article
spelling doaj-5da6c77429154e12bc2a0d51f7551f112021-03-29T20:52:55ZengIEEEIEEE Access2169-35362018-01-016197391975110.1109/ACCESS.2018.28160018316849An Integrated Approach for Massive Sequential Data Processing in Civil Infrastructure Operation and MaintenanceGang Yu0Jiajun Liu1Juan Du2https://orcid.org/0000-0002-6428-7763Min Hu3Vijayan Sugumaran4SHU-UTS SILC Business School, Shanghai University, Shanghai, ChinaSHU-UTS SILC Business School, Shanghai University, Shanghai, ChinaSHU-UTS SILC Business School, Shanghai University, Shanghai, ChinaSHU-UTS SILC Business School, Shanghai University, Shanghai, ChinaDepartment of Decision and Information Sciences, Oakland University, Rochester, MI, USAThis paper presents an extract-transform-load (ETL) approach based on multilayer task execution for processing massive sequential data collected from infrastructure operation and maintenance. The proposed approach consists of ETL task partition, execution mode selection, and ETL modeling. The task partition focuses on dividing the ETL process into four tasks to be executed in accordance with different organizational forms of data. Sequenced or non-sequenced load mode is optional, which is independent of the data standardization. In addition, the ETL modeling phase implements conceptual, logical, and physical modeling for the multi-dimensional model. Our main objective is to integrate massive sequential data, enhancing decision-making performance for the intelligent management platform. Traffic data for two years were collected from various systems and acquisition tools of different providers to evaluate the data integration capability of the proposed approach. Furthermore, Kettle software was used to perform transformation and job modules for the multilayer tasks. In addition, a machine learning algorithm was used to generate traffic warning in the tunnels based on the integrated data. The proposed approach is promising for the management and analysis of massive sequential data generated in operation, the maintenance of transportation tunnels, and effective decision-making.https://ieeexplore.ieee.org/document/8316849/Civil infrastructuremassive data integrationsequential analysismaintenance and operation management
collection DOAJ
language English
format Article
sources DOAJ
author Gang Yu
Jiajun Liu
Juan Du
Min Hu
Vijayan Sugumaran
spellingShingle Gang Yu
Jiajun Liu
Juan Du
Min Hu
Vijayan Sugumaran
An Integrated Approach for Massive Sequential Data Processing in Civil Infrastructure Operation and Maintenance
IEEE Access
Civil infrastructure
massive data integration
sequential analysis
maintenance and operation management
author_facet Gang Yu
Jiajun Liu
Juan Du
Min Hu
Vijayan Sugumaran
author_sort Gang Yu
title An Integrated Approach for Massive Sequential Data Processing in Civil Infrastructure Operation and Maintenance
title_short An Integrated Approach for Massive Sequential Data Processing in Civil Infrastructure Operation and Maintenance
title_full An Integrated Approach for Massive Sequential Data Processing in Civil Infrastructure Operation and Maintenance
title_fullStr An Integrated Approach for Massive Sequential Data Processing in Civil Infrastructure Operation and Maintenance
title_full_unstemmed An Integrated Approach for Massive Sequential Data Processing in Civil Infrastructure Operation and Maintenance
title_sort integrated approach for massive sequential data processing in civil infrastructure operation and maintenance
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2018-01-01
description This paper presents an extract-transform-load (ETL) approach based on multilayer task execution for processing massive sequential data collected from infrastructure operation and maintenance. The proposed approach consists of ETL task partition, execution mode selection, and ETL modeling. The task partition focuses on dividing the ETL process into four tasks to be executed in accordance with different organizational forms of data. Sequenced or non-sequenced load mode is optional, which is independent of the data standardization. In addition, the ETL modeling phase implements conceptual, logical, and physical modeling for the multi-dimensional model. Our main objective is to integrate massive sequential data, enhancing decision-making performance for the intelligent management platform. Traffic data for two years were collected from various systems and acquisition tools of different providers to evaluate the data integration capability of the proposed approach. Furthermore, Kettle software was used to perform transformation and job modules for the multilayer tasks. In addition, a machine learning algorithm was used to generate traffic warning in the tunnels based on the integrated data. The proposed approach is promising for the management and analysis of massive sequential data generated in operation, the maintenance of transportation tunnels, and effective decision-making.
topic Civil infrastructure
massive data integration
sequential analysis
maintenance and operation management
url https://ieeexplore.ieee.org/document/8316849/
work_keys_str_mv AT gangyu anintegratedapproachformassivesequentialdataprocessingincivilinfrastructureoperationandmaintenance
AT jiajunliu anintegratedapproachformassivesequentialdataprocessingincivilinfrastructureoperationandmaintenance
AT juandu anintegratedapproachformassivesequentialdataprocessingincivilinfrastructureoperationandmaintenance
AT minhu anintegratedapproachformassivesequentialdataprocessingincivilinfrastructureoperationandmaintenance
AT vijayansugumaran anintegratedapproachformassivesequentialdataprocessingincivilinfrastructureoperationandmaintenance
AT gangyu integratedapproachformassivesequentialdataprocessingincivilinfrastructureoperationandmaintenance
AT jiajunliu integratedapproachformassivesequentialdataprocessingincivilinfrastructureoperationandmaintenance
AT juandu integratedapproachformassivesequentialdataprocessingincivilinfrastructureoperationandmaintenance
AT minhu integratedapproachformassivesequentialdataprocessingincivilinfrastructureoperationandmaintenance
AT vijayansugumaran integratedapproachformassivesequentialdataprocessingincivilinfrastructureoperationandmaintenance
_version_ 1724193939912654848