Bayesian network structure learning for the uncertain experimentalist : with applications to network biology

In this work, we address both the computational and modeling aspects of Bayesian network structure learning. Several recent algorithms can handle large networks by operating on the space of variable orderings, but for technical reasons they cannot compute many interesting structural features and...

Full description

Bibliographic Details
Main Author: Eaton, Daniel James
Language:English
Published: University of British Columbia 2011
Online Access:http://hdl.handle.net/2429/31610
id ndltd-UBC-oai-circle.library.ubc.ca-2429-31610
record_format oai_dc
spelling ndltd-UBC-oai-circle.library.ubc.ca-2429-316102018-01-05T17:46:13Z Bayesian network structure learning for the uncertain experimentalist : with applications to network biology Eaton, Daniel James In this work, we address both the computational and modeling aspects of Bayesian network structure learning. Several recent algorithms can handle large networks by operating on the space of variable orderings, but for technical reasons they cannot compute many interesting structural features and require the use of a restrictive prior. We introduce a novel MCMC method that utilizes the deterministic output of the exact structure learning algorithm of Koivisto and Sood to construct a fast-mixing proposal on the space of DAGs. We show that in addition to fixing the order-space algorithms' shortcomings, our method outperforms other existing samplers on real datasets by delivering more accurate structure and higher predictive likelihoods in less compute time. Next, we discuss current models of intervention and propose a novel approach named the uncertain intervention model, whereby the targets of an intervention can be learned in parallel to the graph's causal structure. We validate our model experimentally using synthetic data generated from known ground truth. We then apply our model to two biological datasets that have been previously analyzed using Bayesian networks. On the T-cell dataset of Sachs et al. we show that the uncertain intervention model is able to better model the density of the data compared to previous techniques, while on the ALL dataset of Yeoh et al. we demonstrate that our method can be used to directly estimate the genetic effects of the disease. Science, Faculty of Computer Science, Department of Graduate 2011-02-22T21:25:35Z 2011-02-22T21:25:35Z 2007 Text Thesis/Dissertation http://hdl.handle.net/2429/31610 eng For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use. University of British Columbia
collection NDLTD
language English
sources NDLTD
description In this work, we address both the computational and modeling aspects of Bayesian network structure learning. Several recent algorithms can handle large networks by operating on the space of variable orderings, but for technical reasons they cannot compute many interesting structural features and require the use of a restrictive prior. We introduce a novel MCMC method that utilizes the deterministic output of the exact structure learning algorithm of Koivisto and Sood to construct a fast-mixing proposal on the space of DAGs. We show that in addition to fixing the order-space algorithms' shortcomings, our method outperforms other existing samplers on real datasets by delivering more accurate structure and higher predictive likelihoods in less compute time. Next, we discuss current models of intervention and propose a novel approach named the uncertain intervention model, whereby the targets of an intervention can be learned in parallel to the graph's causal structure. We validate our model experimentally using synthetic data generated from known ground truth. We then apply our model to two biological datasets that have been previously analyzed using Bayesian networks. On the T-cell dataset of Sachs et al. we show that the uncertain intervention model is able to better model the density of the data compared to previous techniques, while on the ALL dataset of Yeoh et al. we demonstrate that our method can be used to directly estimate the genetic effects of the disease. === Science, Faculty of === Computer Science, Department of === Graduate
author Eaton, Daniel James
spellingShingle Eaton, Daniel James
Bayesian network structure learning for the uncertain experimentalist : with applications to network biology
author_facet Eaton, Daniel James
author_sort Eaton, Daniel James
title Bayesian network structure learning for the uncertain experimentalist : with applications to network biology
title_short Bayesian network structure learning for the uncertain experimentalist : with applications to network biology
title_full Bayesian network structure learning for the uncertain experimentalist : with applications to network biology
title_fullStr Bayesian network structure learning for the uncertain experimentalist : with applications to network biology
title_full_unstemmed Bayesian network structure learning for the uncertain experimentalist : with applications to network biology
title_sort bayesian network structure learning for the uncertain experimentalist : with applications to network biology
publisher University of British Columbia
publishDate 2011
url http://hdl.handle.net/2429/31610
work_keys_str_mv AT eatondanieljames bayesiannetworkstructurelearningfortheuncertainexperimentalistwithapplicationstonetworkbiology
_version_ 1718594491745566720