Neural Rule Ensembles: Encoding Feature Interactions into Neural Networks

Artificial Neural Networks form the basis of very powerful learning methods. It has been observed that a naive application of fully connected neural networks often leads to overfitting. In an attempt to circumvent this issue, a prior knowledge pertaining to feature interactions can be encoded into t...

Full description

Bibliographic Details
Other Authors: Dawer, Gitesh (author)
Format: Others
Language:English
English
Published: Florida State University
Subjects:
Online Access:http://purl.flvc.org/fsu/fd/2018_Su_Dawer_fsu_0071E_14670
id ndltd-fsu.edu-oai-fsu.digital.flvc.org-fsu_647214
record_format oai_dc
spelling ndltd-fsu.edu-oai-fsu.digital.flvc.org-fsu_6472142019-07-01T05:19:02Z Neural Rule Ensembles: Encoding Feature Interactions into Neural Networks Dawer, Gitesh (author) Barbu, Adrian G., 1971- (professor co-directing dissertation) Gallivan, Kyle A., 1958- (professor co-directing dissertation) Erlebacher, Gordon, 1957- (university representative) Ökten, Giray (committee member) Sussman, Mark (committee member) Florida State University (degree granting institution) College of Arts and Sciences (degree granting college) Department of Mathematics (degree granting departmentdgg) Text text doctoral thesis Florida State University English eng 1 online resource (66 pages) computer application/pdf Artificial Neural Networks form the basis of very powerful learning methods. It has been observed that a naive application of fully connected neural networks often leads to overfitting. In an attempt to circumvent this issue, a prior knowledge pertaining to feature interactions can be encoded into these networks. This defines a task-specific structure on an underlying representation and helps in reducing the number of learnable parameters. Convolutional Neural Network is such an adaptation of artificial neural networks for image datasets which exploits the spatial relationship among the features and explicitly encodes the translational equivariance. Similarly, Recurrent Neural Networks are designed to exploit the temporal relationship inherent in sequential data. However, for tabular datasets, any prior structure on feature relationships is not apparent. In this work, we use decision trees to capture such feature interactions for this kind of datasets and define a mapping to encode extracted relationships into a neural network. This addresses the initialization related concerns of fully connected neural networks and enables learning of compact representations compared to state of the art tree-based approaches. Empirical evaluations and simulation studies show the superiority of such an approach over fully connected neural networks and tree-based approaches. A Dissertation submitted to the Department of Mathematics in partial fulfillment of the requirements for the degree of Doctor of Philosophy. Summer Semester 2018. June 20, 2018. Decision Trees, Ensemble Methods, Neural Networks, Neural Rule Ensembles, Rule Learning, Supervised Learning Includes bibliographical references. Adrian Barbu, Professor Co-Directing Dissertation; Kyle Gallivan, Professor Co-Directing Dissertation; Gordon Erlebacher, University Representative; Giray Okten, Committee Member; Mark Sussman, Committee Member. Applied mathematics Computer science Statistics 2018_Su_Dawer_fsu_0071E_14670 http://purl.flvc.org/fsu/fd/2018_Su_Dawer_fsu_0071E_14670 http://diginole.lib.fsu.edu/islandora/object/fsu%3A647214/datastream/TN/view/Neural%20Rule%20Ensembles.jpg
collection NDLTD
language English
English
format Others
sources NDLTD
topic Applied mathematics
Computer science
Statistics
spellingShingle Applied mathematics
Computer science
Statistics
Neural Rule Ensembles: Encoding Feature Interactions into Neural Networks
description Artificial Neural Networks form the basis of very powerful learning methods. It has been observed that a naive application of fully connected neural networks often leads to overfitting. In an attempt to circumvent this issue, a prior knowledge pertaining to feature interactions can be encoded into these networks. This defines a task-specific structure on an underlying representation and helps in reducing the number of learnable parameters. Convolutional Neural Network is such an adaptation of artificial neural networks for image datasets which exploits the spatial relationship among the features and explicitly encodes the translational equivariance. Similarly, Recurrent Neural Networks are designed to exploit the temporal relationship inherent in sequential data. However, for tabular datasets, any prior structure on feature relationships is not apparent. In this work, we use decision trees to capture such feature interactions for this kind of datasets and define a mapping to encode extracted relationships into a neural network. This addresses the initialization related concerns of fully connected neural networks and enables learning of compact representations compared to state of the art tree-based approaches. Empirical evaluations and simulation studies show the superiority of such an approach over fully connected neural networks and tree-based approaches. === A Dissertation submitted to the Department of Mathematics in partial fulfillment of the requirements for the degree of Doctor of Philosophy. === Summer Semester 2018. === June 20, 2018. === Decision Trees, Ensemble Methods, Neural Networks, Neural Rule Ensembles, Rule Learning, Supervised Learning === Includes bibliographical references. === Adrian Barbu, Professor Co-Directing Dissertation; Kyle Gallivan, Professor Co-Directing Dissertation; Gordon Erlebacher, University Representative; Giray Okten, Committee Member; Mark Sussman, Committee Member.
author2 Dawer, Gitesh (author)
author_facet Dawer, Gitesh (author)
title Neural Rule Ensembles: Encoding Feature Interactions into Neural Networks
title_short Neural Rule Ensembles: Encoding Feature Interactions into Neural Networks
title_full Neural Rule Ensembles: Encoding Feature Interactions into Neural Networks
title_fullStr Neural Rule Ensembles: Encoding Feature Interactions into Neural Networks
title_full_unstemmed Neural Rule Ensembles: Encoding Feature Interactions into Neural Networks
title_sort neural rule ensembles: encoding feature interactions into neural networks
publisher Florida State University
url http://purl.flvc.org/fsu/fd/2018_Su_Dawer_fsu_0071E_14670
_version_ 1719218018364424192