Automated Feature Engineering for Deep Neural Networks with Genetic Programming

Feature engineering is a process that augments the feature vector of a machine learning model with calculated values that are designed to enhance the accuracy of a model’s predictions. Research has shown that the accuracy of models such as deep neural networks, support vector machines, and tree/fore...

Full description

Bibliographic Details
Main Author: Heaton, Jeff T.
Format: Others
Published: NSUWorks 2017
Subjects:
Online Access:http://nsuworks.nova.edu/gscis_etd/994
http://nsuworks.nova.edu/cgi/viewcontent.cgi?article=1995&context=gscis_etd
id ndltd-nova.edu-oai-nsuworks.nova.edu-gscis_etd-1995
record_format oai_dc
spelling ndltd-nova.edu-oai-nsuworks.nova.edu-gscis_etd-19952017-11-29T04:21:04Z Automated Feature Engineering for Deep Neural Networks with Genetic Programming Heaton, Jeff T. Feature engineering is a process that augments the feature vector of a machine learning model with calculated values that are designed to enhance the accuracy of a model’s predictions. Research has shown that the accuracy of models such as deep neural networks, support vector machines, and tree/forest-based algorithms sometimes benefit from feature engineering. Expressions that combine one or more of the original features usually create these engineered features. The choice of the exact structure of an engineered feature is dependent on the type of machine learning model in use. Previous research demonstrated that various model families benefit from different types of engineered feature. Random forests, gradient-boosting machines, or other tree-based models might not see the same accuracy gain that an engineered feature allowed neural networks, generalized linear models, or other dot-product based models to achieve on the same data set. This dissertation presents a genetic programming-based algorithm that automatically engineers features that increase the accuracy of deep neural networks for some data sets. For a genetic programming algorithm to be effective, it must prioritize the search space and efficiently evaluate what it finds. This dissertation algorithm faced a potential search space composed of all possible mathematical combinations of the original feature vector. Five experiments were designed to guide the search process to efficiently evolve good engineered features. The result of this dissertation is an automated feature engineering (AFE) algorithm that is computationally efficient, even though a neural network is used to evaluate each candidate feature. This approach gave the algorithm a greater opportunity to specifically target deep neural networks in its search for engineered features that improve accuracy. Finally, a sixth experiment empirically demonstrated the degree to which this algorithm improved the accuracy of neural networks on data sets augmented by the algorithm’s engineered features. 2017-01-01T08:00:00Z text application/pdf http://nsuworks.nova.edu/gscis_etd/994 http://nsuworks.nova.edu/cgi/viewcontent.cgi?article=1995&context=gscis_etd CEC Theses and Dissertations NSUWorks deep neural network feature engineering genetic programming Computer Sciences
collection NDLTD
format Others
sources NDLTD
topic deep neural network
feature engineering
genetic programming
Computer Sciences
spellingShingle deep neural network
feature engineering
genetic programming
Computer Sciences
Heaton, Jeff T.
Automated Feature Engineering for Deep Neural Networks with Genetic Programming
description Feature engineering is a process that augments the feature vector of a machine learning model with calculated values that are designed to enhance the accuracy of a model’s predictions. Research has shown that the accuracy of models such as deep neural networks, support vector machines, and tree/forest-based algorithms sometimes benefit from feature engineering. Expressions that combine one or more of the original features usually create these engineered features. The choice of the exact structure of an engineered feature is dependent on the type of machine learning model in use. Previous research demonstrated that various model families benefit from different types of engineered feature. Random forests, gradient-boosting machines, or other tree-based models might not see the same accuracy gain that an engineered feature allowed neural networks, generalized linear models, or other dot-product based models to achieve on the same data set. This dissertation presents a genetic programming-based algorithm that automatically engineers features that increase the accuracy of deep neural networks for some data sets. For a genetic programming algorithm to be effective, it must prioritize the search space and efficiently evaluate what it finds. This dissertation algorithm faced a potential search space composed of all possible mathematical combinations of the original feature vector. Five experiments were designed to guide the search process to efficiently evolve good engineered features. The result of this dissertation is an automated feature engineering (AFE) algorithm that is computationally efficient, even though a neural network is used to evaluate each candidate feature. This approach gave the algorithm a greater opportunity to specifically target deep neural networks in its search for engineered features that improve accuracy. Finally, a sixth experiment empirically demonstrated the degree to which this algorithm improved the accuracy of neural networks on data sets augmented by the algorithm’s engineered features.
author Heaton, Jeff T.
author_facet Heaton, Jeff T.
author_sort Heaton, Jeff T.
title Automated Feature Engineering for Deep Neural Networks with Genetic Programming
title_short Automated Feature Engineering for Deep Neural Networks with Genetic Programming
title_full Automated Feature Engineering for Deep Neural Networks with Genetic Programming
title_fullStr Automated Feature Engineering for Deep Neural Networks with Genetic Programming
title_full_unstemmed Automated Feature Engineering for Deep Neural Networks with Genetic Programming
title_sort automated feature engineering for deep neural networks with genetic programming
publisher NSUWorks
publishDate 2017
url http://nsuworks.nova.edu/gscis_etd/994
http://nsuworks.nova.edu/cgi/viewcontent.cgi?article=1995&context=gscis_etd
work_keys_str_mv AT heatonjefft automatedfeatureengineeringfordeepneuralnetworkswithgeneticprogramming
_version_ 1718563158486941696