An Imbalanced Data Handling Framework for Industrial Big Data Using a Gaussian Process Regression-Based Generative Adversarial Network

The developments in the fields of industrial Internet of Things (IIoT) and big data technologies have made it possible to collect a lot of meaningful industrial process and quality-based data. The gathered data are analyzed using contemporary statistical methods and machine learning techniques. Then...

Full description

Bibliographic Details
Main Authors: Eunseo Oh, Hyunsoo Lee
Format: Article
Language:English
Published: MDPI AG 2020-04-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/12/4/669
id doaj-70e14801d54d4432bc9317e8716de315
record_format Article
spelling doaj-70e14801d54d4432bc9317e8716de3152020-11-25T03:13:19ZengMDPI AGSymmetry2073-89942020-04-011266966910.3390/sym12040669An Imbalanced Data Handling Framework for Industrial Big Data Using a Gaussian Process Regression-Based Generative Adversarial NetworkEunseo Oh0Hyunsoo Lee1School of Industrial Engineering, Kumoh National Institute of Technology, Gumi 39177, KoreaSchool of Industrial Engineering, Kumoh National Institute of Technology, Gumi 39177, KoreaThe developments in the fields of industrial Internet of Things (IIoT) and big data technologies have made it possible to collect a lot of meaningful industrial process and quality-based data. The gathered data are analyzed using contemporary statistical methods and machine learning techniques. Then, the extracted knowledge can be used for predictive maintenance or prognostic health management. However, it is difficult to gather complete data due to several issues in IIoT, such as devices breaking down, running out of battery, or undergoing scheduled maintenance. Data with missing values are often ignored, as they may contain insufficient information from which to draw conclusions. In order to overcome these issues, we propose a novel, effective missing data handling mechanism for the concepts of symmetry principles. While other existing methods only attempt to estimate missing parts, the proposed method generates a whole set of data set using Gaussian process regression and a generative adversarial network. In order to prove the effectiveness of the proposed framework, we examine a real-world, industrial case involving an air pressure system (APS), where we use the proposed method to make quality predictions and compare the results with existing state-of-the-art estimation methods.https://www.mdpi.com/2073-8994/12/4/669missing data generationindustrial big dataair pressure systemgenerative adversarial networkGaussian process regression
collection DOAJ
language English
format Article
sources DOAJ
author Eunseo Oh
Hyunsoo Lee
spellingShingle Eunseo Oh
Hyunsoo Lee
An Imbalanced Data Handling Framework for Industrial Big Data Using a Gaussian Process Regression-Based Generative Adversarial Network
Symmetry
missing data generation
industrial big data
air pressure system
generative adversarial network
Gaussian process regression
author_facet Eunseo Oh
Hyunsoo Lee
author_sort Eunseo Oh
title An Imbalanced Data Handling Framework for Industrial Big Data Using a Gaussian Process Regression-Based Generative Adversarial Network
title_short An Imbalanced Data Handling Framework for Industrial Big Data Using a Gaussian Process Regression-Based Generative Adversarial Network
title_full An Imbalanced Data Handling Framework for Industrial Big Data Using a Gaussian Process Regression-Based Generative Adversarial Network
title_fullStr An Imbalanced Data Handling Framework for Industrial Big Data Using a Gaussian Process Regression-Based Generative Adversarial Network
title_full_unstemmed An Imbalanced Data Handling Framework for Industrial Big Data Using a Gaussian Process Regression-Based Generative Adversarial Network
title_sort imbalanced data handling framework for industrial big data using a gaussian process regression-based generative adversarial network
publisher MDPI AG
series Symmetry
issn 2073-8994
publishDate 2020-04-01
description The developments in the fields of industrial Internet of Things (IIoT) and big data technologies have made it possible to collect a lot of meaningful industrial process and quality-based data. The gathered data are analyzed using contemporary statistical methods and machine learning techniques. Then, the extracted knowledge can be used for predictive maintenance or prognostic health management. However, it is difficult to gather complete data due to several issues in IIoT, such as devices breaking down, running out of battery, or undergoing scheduled maintenance. Data with missing values are often ignored, as they may contain insufficient information from which to draw conclusions. In order to overcome these issues, we propose a novel, effective missing data handling mechanism for the concepts of symmetry principles. While other existing methods only attempt to estimate missing parts, the proposed method generates a whole set of data set using Gaussian process regression and a generative adversarial network. In order to prove the effectiveness of the proposed framework, we examine a real-world, industrial case involving an air pressure system (APS), where we use the proposed method to make quality predictions and compare the results with existing state-of-the-art estimation methods.
topic missing data generation
industrial big data
air pressure system
generative adversarial network
Gaussian process regression
url https://www.mdpi.com/2073-8994/12/4/669
work_keys_str_mv AT eunseooh animbalanceddatahandlingframeworkforindustrialbigdatausingagaussianprocessregressionbasedgenerativeadversarialnetwork
AT hyunsoolee animbalanceddatahandlingframeworkforindustrialbigdatausingagaussianprocessregressionbasedgenerativeadversarialnetwork
AT eunseooh imbalanceddatahandlingframeworkforindustrialbigdatausingagaussianprocessregressionbasedgenerativeadversarialnetwork
AT hyunsoolee imbalanceddatahandlingframeworkforindustrialbigdatausingagaussianprocessregressionbasedgenerativeadversarialnetwork
_version_ 1724647489505591296