Structure Learning of Gaussian Markov Random Fields with False Discovery Rate Control

In this paper, we propose a new estimation procedure for discovering the structure of Gaussian Markov random fields (MRFs) with false discovery rate (FDR) control, making use of the sorted <inline-formula> <math display="inline"> <semantics> <msub> <mi>ℓ</m...

Full description

Bibliographic Details
Main Authors:	Sangkyun Lee, Piotr Sobczyk, Malgorzata Bogdan
Format:	Article
Language:	English
Published:	MDPI AG 2019-10-01
Series:	Symmetry
Subjects:	gaussian markov random field inverse covariance matrix estimation fdr control
Online Access:	https://www.mdpi.com/2073-8994/11/10/1311

id	doaj-3dea40bf377b4eb3b85cd2e66e3e6a95
record_format	Article
spelling	doaj-3dea40bf377b4eb3b85cd2e66e3e6a952020-11-25T02:16:00ZengMDPI AGSymmetry2073-89942019-10-011110131110.3390/sym11101311sym11101311Structure Learning of Gaussian Markov Random Fields with False Discovery Rate ControlSangkyun Lee0Piotr Sobczyk1Malgorzata Bogdan2Computer Science, Hanyang University ERICA, Ansan 15588, KoreaDepartment of Mathematics, Wroclaw University of Science and Technology, 50-370 Wroclaw, PolandInstitute of Mathematics, University of Wroclaw, 50-384 Wroclaw, PolandIn this paper, we propose a new estimation procedure for discovering the structure of Gaussian Markov random fields (MRFs) with false discovery rate (FDR) control, making use of the sorted <inline-formula> <math display="inline"> <semantics> <msub> <mi>ℓ</mi> <mn>1</mn> </msub> </semantics> </math> </inline-formula>-norm (SL1) regularization. A Gaussian MRF is an acyclic graph representing a multivariate Gaussian distribution, where nodes are random variables and edges represent the conditional dependence between the connected nodes. Since it is possible to learn the edge structure of Gaussian MRFs directly from data, Gaussian MRFs provide an excellent way to understand complex data by revealing the dependence structure among many inputs features, such as genes, sensors, users, documents, etc. In learning the graphical structure of Gaussian MRFs, it is desired to discover the actual edges of the underlying but unknown probabilistic graphical model—it becomes more complicated when the number of random variables (features) <i>p</i> increases, compared to the number of data points <i>n</i>. In particular, when <inline-formula> <math display="inline"> <semantics> <mrow> <mi>p</mi> <mo>≫</mo> <mi>n</mi> </mrow> </semantics> </math> </inline-formula>, it is statistically unavoidable for any estimation procedure to include false edges. Therefore, there have been many trials to reduce the false detection of edges, in particular, using different types of regularization on the learning parameters. Our method makes use of the SL1 regularization, introduced recently for model selection in linear regression. We focus on the benefit of SL1 regularization that it can be used to control the FDR of detecting important random variables. Adapting SL1 for probabilistic graphical models, we show that SL1 can be used for the structure learning of Gaussian MRFs using our suggested procedure nsSLOPE (neighborhood selection Sorted L-One Penalized Estimation), controlling the FDR of detecting edges.https://www.mdpi.com/2073-8994/11/10/1311gaussian markov random fieldinverse covariance matrix estimationfdr control
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Sangkyun Lee Piotr Sobczyk Malgorzata Bogdan
spellingShingle	Sangkyun Lee Piotr Sobczyk Malgorzata Bogdan Structure Learning of Gaussian Markov Random Fields with False Discovery Rate Control Symmetry gaussian markov random field inverse covariance matrix estimation fdr control
author_facet	Sangkyun Lee Piotr Sobczyk Malgorzata Bogdan
author_sort	Sangkyun Lee
title	Structure Learning of Gaussian Markov Random Fields with False Discovery Rate Control
title_short	Structure Learning of Gaussian Markov Random Fields with False Discovery Rate Control
title_full	Structure Learning of Gaussian Markov Random Fields with False Discovery Rate Control
title_fullStr	Structure Learning of Gaussian Markov Random Fields with False Discovery Rate Control
title_full_unstemmed	Structure Learning of Gaussian Markov Random Fields with False Discovery Rate Control
title_sort	structure learning of gaussian markov random fields with false discovery rate control
publisher	MDPI AG
series	Symmetry
issn	2073-8994
publishDate	2019-10-01
description	In this paper, we propose a new estimation procedure for discovering the structure of Gaussian Markov random fields (MRFs) with false discovery rate (FDR) control, making use of the sorted <inline-formula> <math display="inline"> <semantics> <msub> <mi>ℓ</mi> <mn>1</mn> </msub> </semantics> </math> </inline-formula>-norm (SL1) regularization. A Gaussian MRF is an acyclic graph representing a multivariate Gaussian distribution, where nodes are random variables and edges represent the conditional dependence between the connected nodes. Since it is possible to learn the edge structure of Gaussian MRFs directly from data, Gaussian MRFs provide an excellent way to understand complex data by revealing the dependence structure among many inputs features, such as genes, sensors, users, documents, etc. In learning the graphical structure of Gaussian MRFs, it is desired to discover the actual edges of the underlying but unknown probabilistic graphical model—it becomes more complicated when the number of random variables (features) <i>p</i> increases, compared to the number of data points <i>n</i>. In particular, when <inline-formula> <math display="inline"> <semantics> <mrow> <mi>p</mi> <mo>≫</mo> <mi>n</mi> </mrow> </semantics> </math> </inline-formula>, it is statistically unavoidable for any estimation procedure to include false edges. Therefore, there have been many trials to reduce the false detection of edges, in particular, using different types of regularization on the learning parameters. Our method makes use of the SL1 regularization, introduced recently for model selection in linear regression. We focus on the benefit of SL1 regularization that it can be used to control the FDR of detecting important random variables. Adapting SL1 for probabilistic graphical models, we show that SL1 can be used for the structure learning of Gaussian MRFs using our suggested procedure nsSLOPE (neighborhood selection Sorted L-One Penalized Estimation), controlling the FDR of detecting edges.
topic	gaussian markov random field inverse covariance matrix estimation fdr control
url	https://www.mdpi.com/2073-8994/11/10/1311
work_keys_str_mv	AT sangkyunlee structurelearningofgaussianmarkovrandomfieldswithfalsediscoveryratecontrol AT piotrsobczyk structurelearningofgaussianmarkovrandomfieldswithfalsediscoveryratecontrol AT malgorzatabogdan structurelearningofgaussianmarkovrandomfieldswithfalsediscoveryratecontrol
_version_	1724893451208622080

Structure Learning of Gaussian Markov Random Fields with False Discovery Rate Control

Similar Items