A New Algorithm for Learning Large Bayesian Network Structure From Discrete Data

Learning the structure of Bayesian networks (BNs) from high dimensional discrete data is common nowadays but a challenging task, due to the large parameter space, the acyclicity constraint placed on the graphical structures and the difficulty in searching for a sparse structure. In this article, we...

Full description

Bibliographic Details
Main Authors: Weiping Zhang, Ziqiang Xu, Yu Chen, Jing Yang
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8813012/
id doaj-e1891b5706224e7b9a1c57f6d54f4ba7
record_format Article
spelling doaj-e1891b5706224e7b9a1c57f6d54f4ba72021-03-29T23:24:46ZengIEEEIEEE Access2169-35362019-01-01712166512167410.1109/ACCESS.2019.29375818813012A New Algorithm for Learning Large Bayesian Network Structure From Discrete DataWeiping Zhang0https://orcid.org/0000-0003-0118-5097Ziqiang Xu1Yu Chen2https://orcid.org/0000-0002-2438-3451Jing Yang3https://orcid.org/0000-0003-3922-299XSchool of Management, School of Data Science, University of Science and Technology of China, Hefei, ChinaSchool of Management, School of Data Science, University of Science and Technology of China, Hefei, ChinaSchool of Management, School of Data Science, University of Science and Technology of China, Hefei, ChinaDepartment of Computer Science and Technology, Hefei University of Technology, Hefei, ChinaLearning the structure of Bayesian networks (BNs) from high dimensional discrete data is common nowadays but a challenging task, due to the large parameter space, the acyclicity constraint placed on the graphical structures and the difficulty in searching for a sparse structure. In this article, we propose a sparse structure learning algorithm (SSLA) to solve this problem. The algorithm uses the negative log-likelihood function of multi-logit regression as a loss function, adding the adaptive group lasso as a penalty term for sparsity, with a new penalty term to ensure that the learned graph is a directed acyclic graph. A block coordinate descent algorithm (BCD) combining with the alternating direction multiplier method (ADMM) algorithm is developed to solve the proposed model. The learned graph is proved theoretically to be a Bayesian network. In order to evaluate the proposed SSLA and compare with its competitors, we conducted intensive simulation studies and applied them to the benchmark Bayesian networks. The results indicate that the SSLA is superior to the hill climbing (HC) algorithm, the CD algorithm and the BFO-B algorithm respectively, and is competitive with K2 algorithm when the order of the nodes is given.https://ieeexplore.ieee.org/document/8813012/Network theoryregression analysisBayesian methodsadaptive group lassomulti-logit regressionstructure learning
collection DOAJ
language English
format Article
sources DOAJ
author Weiping Zhang
Ziqiang Xu
Yu Chen
Jing Yang
spellingShingle Weiping Zhang
Ziqiang Xu
Yu Chen
Jing Yang
A New Algorithm for Learning Large Bayesian Network Structure From Discrete Data
IEEE Access
Network theory
regression analysis
Bayesian methods
adaptive group lasso
multi-logit regression
structure learning
author_facet Weiping Zhang
Ziqiang Xu
Yu Chen
Jing Yang
author_sort Weiping Zhang
title A New Algorithm for Learning Large Bayesian Network Structure From Discrete Data
title_short A New Algorithm for Learning Large Bayesian Network Structure From Discrete Data
title_full A New Algorithm for Learning Large Bayesian Network Structure From Discrete Data
title_fullStr A New Algorithm for Learning Large Bayesian Network Structure From Discrete Data
title_full_unstemmed A New Algorithm for Learning Large Bayesian Network Structure From Discrete Data
title_sort new algorithm for learning large bayesian network structure from discrete data
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Learning the structure of Bayesian networks (BNs) from high dimensional discrete data is common nowadays but a challenging task, due to the large parameter space, the acyclicity constraint placed on the graphical structures and the difficulty in searching for a sparse structure. In this article, we propose a sparse structure learning algorithm (SSLA) to solve this problem. The algorithm uses the negative log-likelihood function of multi-logit regression as a loss function, adding the adaptive group lasso as a penalty term for sparsity, with a new penalty term to ensure that the learned graph is a directed acyclic graph. A block coordinate descent algorithm (BCD) combining with the alternating direction multiplier method (ADMM) algorithm is developed to solve the proposed model. The learned graph is proved theoretically to be a Bayesian network. In order to evaluate the proposed SSLA and compare with its competitors, we conducted intensive simulation studies and applied them to the benchmark Bayesian networks. The results indicate that the SSLA is superior to the hill climbing (HC) algorithm, the CD algorithm and the BFO-B algorithm respectively, and is competitive with K2 algorithm when the order of the nodes is given.
topic Network theory
regression analysis
Bayesian methods
adaptive group lasso
multi-logit regression
structure learning
url https://ieeexplore.ieee.org/document/8813012/
work_keys_str_mv AT weipingzhang anewalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata
AT ziqiangxu anewalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata
AT yuchen anewalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata
AT jingyang anewalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata
AT weipingzhang newalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata
AT ziqiangxu newalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata
AT yuchen newalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata
AT jingyang newalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata
_version_ 1724189599825133568