Dirty engineering data-driven inverse prediction machine learning model

Abstract Most data-driven machine learning (ML) approaches established in metallurgy research fields are focused on a build-up of reliable quantitative models that predict a material property from a given set of material conditions. In general, the input feature dimension (the number of material con...

Full description

Bibliographic Details
Main Authors: Jin-Woong Lee, Woon Bae Park, Byung Do Lee, Seonghwan Kim, Nam Hoon Goo, Kee-Sun Sohn
Format: Article
Language:English
Published: Nature Publishing Group 2020-11-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-020-77575-0
id doaj-97f45af18b5640c5a76a5470b7e82b31
record_format Article
spelling doaj-97f45af18b5640c5a76a5470b7e82b312020-12-08T12:46:47ZengNature Publishing GroupScientific Reports2045-23222020-11-0110111410.1038/s41598-020-77575-0Dirty engineering data-driven inverse prediction machine learning modelJin-Woong Lee0Woon Bae Park1Byung Do Lee2Seonghwan Kim3Nam Hoon Goo4Kee-Sun Sohn5Faculty of Nanotechnology and Advanced Materials Engineering, Sejong UniversityDepartment of Printed Electronics, Sunchon National UniversityFaculty of Nanotechnology and Advanced Materials Engineering, Sejong UniversityAdvanced Research Team, Hyundai Steel DangJin WorksAdvanced Research Team, Hyundai Steel DangJin WorksFaculty of Nanotechnology and Advanced Materials Engineering, Sejong UniversityAbstract Most data-driven machine learning (ML) approaches established in metallurgy research fields are focused on a build-up of reliable quantitative models that predict a material property from a given set of material conditions. In general, the input feature dimension (the number of material condition variables) is much higher than the output feature dimension (the number of material properties of concern). Rather than such a forward-prediction ML model, it is necessary to develop so-called inverse-design modeling, wherein required material conditions could be deduced from a set of desired material properties. Here we report a novel inverse design strategy that employs two independent approaches: a metaheuristics-assisted inverse reading of conventional forward ML models and an atypical inverse ML model based on a modified variational autoencoder. These two unprecedented approaches were successful and led to overlapped results, from which we pinpointed several novel thermo-mechanically controlled processed (TMCP) steel alloy candidates that were validated by a rule-based thermodynamic calculation tool (Thermo-Calc.). We also suggested a practical protocol to elucidate how to treat engineering data collected from industry, which is not prepared as independent and identically distributed (IID) random data.https://doi.org/10.1038/s41598-020-77575-0
collection DOAJ
language English
format Article
sources DOAJ
author Jin-Woong Lee
Woon Bae Park
Byung Do Lee
Seonghwan Kim
Nam Hoon Goo
Kee-Sun Sohn
spellingShingle Jin-Woong Lee
Woon Bae Park
Byung Do Lee
Seonghwan Kim
Nam Hoon Goo
Kee-Sun Sohn
Dirty engineering data-driven inverse prediction machine learning model
Scientific Reports
author_facet Jin-Woong Lee
Woon Bae Park
Byung Do Lee
Seonghwan Kim
Nam Hoon Goo
Kee-Sun Sohn
author_sort Jin-Woong Lee
title Dirty engineering data-driven inverse prediction machine learning model
title_short Dirty engineering data-driven inverse prediction machine learning model
title_full Dirty engineering data-driven inverse prediction machine learning model
title_fullStr Dirty engineering data-driven inverse prediction machine learning model
title_full_unstemmed Dirty engineering data-driven inverse prediction machine learning model
title_sort dirty engineering data-driven inverse prediction machine learning model
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2020-11-01
description Abstract Most data-driven machine learning (ML) approaches established in metallurgy research fields are focused on a build-up of reliable quantitative models that predict a material property from a given set of material conditions. In general, the input feature dimension (the number of material condition variables) is much higher than the output feature dimension (the number of material properties of concern). Rather than such a forward-prediction ML model, it is necessary to develop so-called inverse-design modeling, wherein required material conditions could be deduced from a set of desired material properties. Here we report a novel inverse design strategy that employs two independent approaches: a metaheuristics-assisted inverse reading of conventional forward ML models and an atypical inverse ML model based on a modified variational autoencoder. These two unprecedented approaches were successful and led to overlapped results, from which we pinpointed several novel thermo-mechanically controlled processed (TMCP) steel alloy candidates that were validated by a rule-based thermodynamic calculation tool (Thermo-Calc.). We also suggested a practical protocol to elucidate how to treat engineering data collected from industry, which is not prepared as independent and identically distributed (IID) random data.
url https://doi.org/10.1038/s41598-020-77575-0
work_keys_str_mv AT jinwoonglee dirtyengineeringdatadriveninversepredictionmachinelearningmodel
AT woonbaepark dirtyengineeringdatadriveninversepredictionmachinelearningmodel
AT byungdolee dirtyengineeringdatadriveninversepredictionmachinelearningmodel
AT seonghwankim dirtyengineeringdatadriveninversepredictionmachinelearningmodel
AT namhoongoo dirtyengineeringdatadriveninversepredictionmachinelearningmodel
AT keesunsohn dirtyengineeringdatadriveninversepredictionmachinelearningmodel
_version_ 1724389372520824832