A New Algorithm for Learning Large Bayesian Network Structure From Discrete Data
Learning the structure of Bayesian networks (BNs) from high dimensional discrete data is common nowadays but a challenging task, due to the large parameter space, the acyclicity constraint placed on the graphical structures and the difficulty in searching for a sparse structure. In this article, we...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8813012/ |
id |
doaj-e1891b5706224e7b9a1c57f6d54f4ba7 |
---|---|
record_format |
Article |
spelling |
doaj-e1891b5706224e7b9a1c57f6d54f4ba72021-03-29T23:24:46ZengIEEEIEEE Access2169-35362019-01-01712166512167410.1109/ACCESS.2019.29375818813012A New Algorithm for Learning Large Bayesian Network Structure From Discrete DataWeiping Zhang0https://orcid.org/0000-0003-0118-5097Ziqiang Xu1Yu Chen2https://orcid.org/0000-0002-2438-3451Jing Yang3https://orcid.org/0000-0003-3922-299XSchool of Management, School of Data Science, University of Science and Technology of China, Hefei, ChinaSchool of Management, School of Data Science, University of Science and Technology of China, Hefei, ChinaSchool of Management, School of Data Science, University of Science and Technology of China, Hefei, ChinaDepartment of Computer Science and Technology, Hefei University of Technology, Hefei, ChinaLearning the structure of Bayesian networks (BNs) from high dimensional discrete data is common nowadays but a challenging task, due to the large parameter space, the acyclicity constraint placed on the graphical structures and the difficulty in searching for a sparse structure. In this article, we propose a sparse structure learning algorithm (SSLA) to solve this problem. The algorithm uses the negative log-likelihood function of multi-logit regression as a loss function, adding the adaptive group lasso as a penalty term for sparsity, with a new penalty term to ensure that the learned graph is a directed acyclic graph. A block coordinate descent algorithm (BCD) combining with the alternating direction multiplier method (ADMM) algorithm is developed to solve the proposed model. The learned graph is proved theoretically to be a Bayesian network. In order to evaluate the proposed SSLA and compare with its competitors, we conducted intensive simulation studies and applied them to the benchmark Bayesian networks. The results indicate that the SSLA is superior to the hill climbing (HC) algorithm, the CD algorithm and the BFO-B algorithm respectively, and is competitive with K2 algorithm when the order of the nodes is given.https://ieeexplore.ieee.org/document/8813012/Network theoryregression analysisBayesian methodsadaptive group lassomulti-logit regressionstructure learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Weiping Zhang Ziqiang Xu Yu Chen Jing Yang |
spellingShingle |
Weiping Zhang Ziqiang Xu Yu Chen Jing Yang A New Algorithm for Learning Large Bayesian Network Structure From Discrete Data IEEE Access Network theory regression analysis Bayesian methods adaptive group lasso multi-logit regression structure learning |
author_facet |
Weiping Zhang Ziqiang Xu Yu Chen Jing Yang |
author_sort |
Weiping Zhang |
title |
A New Algorithm for Learning Large Bayesian Network Structure From Discrete Data |
title_short |
A New Algorithm for Learning Large Bayesian Network Structure From Discrete Data |
title_full |
A New Algorithm for Learning Large Bayesian Network Structure From Discrete Data |
title_fullStr |
A New Algorithm for Learning Large Bayesian Network Structure From Discrete Data |
title_full_unstemmed |
A New Algorithm for Learning Large Bayesian Network Structure From Discrete Data |
title_sort |
new algorithm for learning large bayesian network structure from discrete data |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
Learning the structure of Bayesian networks (BNs) from high dimensional discrete data is common nowadays but a challenging task, due to the large parameter space, the acyclicity constraint placed on the graphical structures and the difficulty in searching for a sparse structure. In this article, we propose a sparse structure learning algorithm (SSLA) to solve this problem. The algorithm uses the negative log-likelihood function of multi-logit regression as a loss function, adding the adaptive group lasso as a penalty term for sparsity, with a new penalty term to ensure that the learned graph is a directed acyclic graph. A block coordinate descent algorithm (BCD) combining with the alternating direction multiplier method (ADMM) algorithm is developed to solve the proposed model. The learned graph is proved theoretically to be a Bayesian network. In order to evaluate the proposed SSLA and compare with its competitors, we conducted intensive simulation studies and applied them to the benchmark Bayesian networks. The results indicate that the SSLA is superior to the hill climbing (HC) algorithm, the CD algorithm and the BFO-B algorithm respectively, and is competitive with K2 algorithm when the order of the nodes is given. |
topic |
Network theory regression analysis Bayesian methods adaptive group lasso multi-logit regression structure learning |
url |
https://ieeexplore.ieee.org/document/8813012/ |
work_keys_str_mv |
AT weipingzhang anewalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata AT ziqiangxu anewalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata AT yuchen anewalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata AT jingyang anewalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata AT weipingzhang newalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata AT ziqiangxu newalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata AT yuchen newalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata AT jingyang newalgorithmforlearninglargebayesiannetworkstructurefromdiscretedata |
_version_ |
1724189599825133568 |