Bayesian network structure learning for the uncertain experimentalist : with applications to network biology
In this work, we address both the computational and modeling aspects of Bayesian network structure learning. Several recent algorithms can handle large networks by operating on the space of variable orderings, but for technical reasons they cannot compute many interesting structural features and...
Main Author: | |
---|---|
Language: | English |
Published: |
University of British Columbia
2011
|
Online Access: | http://hdl.handle.net/2429/31610 |
id |
ndltd-UBC-oai-circle.library.ubc.ca-2429-31610 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UBC-oai-circle.library.ubc.ca-2429-316102018-01-05T17:46:13Z Bayesian network structure learning for the uncertain experimentalist : with applications to network biology Eaton, Daniel James In this work, we address both the computational and modeling aspects of Bayesian network structure learning. Several recent algorithms can handle large networks by operating on the space of variable orderings, but for technical reasons they cannot compute many interesting structural features and require the use of a restrictive prior. We introduce a novel MCMC method that utilizes the deterministic output of the exact structure learning algorithm of Koivisto and Sood to construct a fast-mixing proposal on the space of DAGs. We show that in addition to fixing the order-space algorithms' shortcomings, our method outperforms other existing samplers on real datasets by delivering more accurate structure and higher predictive likelihoods in less compute time. Next, we discuss current models of intervention and propose a novel approach named the uncertain intervention model, whereby the targets of an intervention can be learned in parallel to the graph's causal structure. We validate our model experimentally using synthetic data generated from known ground truth. We then apply our model to two biological datasets that have been previously analyzed using Bayesian networks. On the T-cell dataset of Sachs et al. we show that the uncertain intervention model is able to better model the density of the data compared to previous techniques, while on the ALL dataset of Yeoh et al. we demonstrate that our method can be used to directly estimate the genetic effects of the disease. Science, Faculty of Computer Science, Department of Graduate 2011-02-22T21:25:35Z 2011-02-22T21:25:35Z 2007 Text Thesis/Dissertation http://hdl.handle.net/2429/31610 eng For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use. University of British Columbia |
collection |
NDLTD |
language |
English |
sources |
NDLTD |
description |
In this work, we address both the computational and modeling aspects of
Bayesian network structure learning. Several recent algorithms can handle
large networks by operating on the space of variable orderings, but for technical
reasons they cannot compute many interesting structural features and
require the use of a restrictive prior. We introduce a novel MCMC method that
utilizes the deterministic output of the exact structure learning algorithm of
Koivisto and Sood to construct a fast-mixing proposal on the space of DAGs.
We show that in addition to fixing the order-space algorithms' shortcomings,
our method outperforms other existing samplers on real datasets by delivering
more accurate structure and higher predictive likelihoods in less compute time.
Next, we discuss current models of intervention and propose a novel approach
named the uncertain intervention model, whereby the targets of an intervention
can be learned in parallel to the graph's causal structure. We validate our
model experimentally using synthetic data generated from known ground truth.
We then apply our model to two biological datasets that have been previously
analyzed using Bayesian networks. On the T-cell dataset of Sachs et al. we
show that the uncertain intervention model is able to better model the density
of the data compared to previous techniques, while on the ALL dataset of Yeoh
et al. we demonstrate that our method can be used to directly estimate the
genetic effects of the disease. === Science, Faculty of === Computer Science, Department of === Graduate |
author |
Eaton, Daniel James |
spellingShingle |
Eaton, Daniel James Bayesian network structure learning for the uncertain experimentalist : with applications to network biology |
author_facet |
Eaton, Daniel James |
author_sort |
Eaton, Daniel James |
title |
Bayesian network structure learning for the uncertain experimentalist : with applications to network biology |
title_short |
Bayesian network structure learning for the uncertain experimentalist : with applications to network biology |
title_full |
Bayesian network structure learning for the uncertain experimentalist : with applications to network biology |
title_fullStr |
Bayesian network structure learning for the uncertain experimentalist : with applications to network biology |
title_full_unstemmed |
Bayesian network structure learning for the uncertain experimentalist : with applications to network biology |
title_sort |
bayesian network structure learning for the uncertain experimentalist : with applications to network biology |
publisher |
University of British Columbia |
publishDate |
2011 |
url |
http://hdl.handle.net/2429/31610 |
work_keys_str_mv |
AT eatondanieljames bayesiannetworkstructurelearningfortheuncertainexperimentalistwithapplicationstonetworkbiology |
_version_ |
1718594491745566720 |