Polytopes Arising from Binary Multi-way Contingency Tables and Characteristic Imsets for Bayesian Networks

The main theme of this dissertation is the study of polytopes arising from binary multi-way contingency tables and characteristic imsets for Bayesian networks. Firstly, we study on three-way tables whose entries are independent Bernoulli ran- dom variables with canonical parameters under no three-wa...

Full description

Bibliographic Details
Main Author: Xi, Jing
Format: Others
Published: UKnowledge 2013
Subjects:
Online Access:http://uknowledge.uky.edu/statistics_etds/5
http://uknowledge.uky.edu/cgi/viewcontent.cgi?article=1008&context=statistics_etds
id ndltd-uky.edu-oai-uknowledge.uky.edu-statistics_etds-1008
record_format oai_dc
spelling ndltd-uky.edu-oai-uknowledge.uky.edu-statistics_etds-10082015-04-11T05:03:34Z Polytopes Arising from Binary Multi-way Contingency Tables and Characteristic Imsets for Bayesian Networks Xi, Jing The main theme of this dissertation is the study of polytopes arising from binary multi-way contingency tables and characteristic imsets for Bayesian networks. Firstly, we study on three-way tables whose entries are independent Bernoulli ran- dom variables with canonical parameters under no three-way interaction generalized linear models. Here, we use the sequential importance sampling (SIS) method with the conditional Poisson (CP) distribution to sample binary three-way tables with the sufficient statistics, i.e., all two-way marginal sums, fixed. Compared with Monte Carlo Markov Chain (MCMC) approach with a Markov basis (MB), SIS procedure has the advantage that it does not require expensive or prohibitive pre-computations. Note that this problem can also be considered as estimating the number of lattice points inside the polytope defined by the zero-one and two-way marginal constraints. The theorems in Chapter 2 give the parameters for the CP distribution on each column when it is sampled. In this chapter, we also present the algorithms, the simulation results, and the results for Samson’s monks data. Bayesian networks, a part of the family of probabilistic graphical models, are widely applied in many areas and much work has been done in model selections for Bayesian networks. The second part of this dissertation investigates the problem of finding the optimal graph by using characteristic imsets, where characteristic imsets are defined as 0-1 vector representations of Bayesian networks which are unique up to Markov equivalence. Characteristic imset polytopes are defined as the convex hull of all characteristic imsets we consider. It was proven that the problem of finding optimal Bayesian network for a specific dataset can be converted to a linear programming problem over the characteristic imset polytope [51]. In Chapter 3, we first consider characteristic imset polytopes for all diagnosis models and show that these polytopes are direct product of simplices. Then we give the combinatorial description of all edges and all facets of these polytopes. At the end of this chapter, we generalize these results to the characteristic imset polytopes for all Bayesian networks with a fixed underlying ordering of nodes. Chapter 4 includes discussion and future work on these two topics. 2013-01-01T08:00:00Z text application/pdf http://uknowledge.uky.edu/statistics_etds/5 http://uknowledge.uky.edu/cgi/viewcontent.cgi?article=1008&context=statistics_etds Theses and Dissertations--Statistics UKnowledge Sequential importance sampling Conditional Poisson Counting prob- lem Learning Bayesian networks Characteristic imset polytope Statistics and Probability
collection NDLTD
format Others
sources NDLTD
topic Sequential importance sampling
Conditional Poisson
Counting prob- lem
Learning Bayesian networks
Characteristic imset polytope
Statistics and Probability
spellingShingle Sequential importance sampling
Conditional Poisson
Counting prob- lem
Learning Bayesian networks
Characteristic imset polytope
Statistics and Probability
Xi, Jing
Polytopes Arising from Binary Multi-way Contingency Tables and Characteristic Imsets for Bayesian Networks
description The main theme of this dissertation is the study of polytopes arising from binary multi-way contingency tables and characteristic imsets for Bayesian networks. Firstly, we study on three-way tables whose entries are independent Bernoulli ran- dom variables with canonical parameters under no three-way interaction generalized linear models. Here, we use the sequential importance sampling (SIS) method with the conditional Poisson (CP) distribution to sample binary three-way tables with the sufficient statistics, i.e., all two-way marginal sums, fixed. Compared with Monte Carlo Markov Chain (MCMC) approach with a Markov basis (MB), SIS procedure has the advantage that it does not require expensive or prohibitive pre-computations. Note that this problem can also be considered as estimating the number of lattice points inside the polytope defined by the zero-one and two-way marginal constraints. The theorems in Chapter 2 give the parameters for the CP distribution on each column when it is sampled. In this chapter, we also present the algorithms, the simulation results, and the results for Samson’s monks data. Bayesian networks, a part of the family of probabilistic graphical models, are widely applied in many areas and much work has been done in model selections for Bayesian networks. The second part of this dissertation investigates the problem of finding the optimal graph by using characteristic imsets, where characteristic imsets are defined as 0-1 vector representations of Bayesian networks which are unique up to Markov equivalence. Characteristic imset polytopes are defined as the convex hull of all characteristic imsets we consider. It was proven that the problem of finding optimal Bayesian network for a specific dataset can be converted to a linear programming problem over the characteristic imset polytope [51]. In Chapter 3, we first consider characteristic imset polytopes for all diagnosis models and show that these polytopes are direct product of simplices. Then we give the combinatorial description of all edges and all facets of these polytopes. At the end of this chapter, we generalize these results to the characteristic imset polytopes for all Bayesian networks with a fixed underlying ordering of nodes. Chapter 4 includes discussion and future work on these two topics.
author Xi, Jing
author_facet Xi, Jing
author_sort Xi, Jing
title Polytopes Arising from Binary Multi-way Contingency Tables and Characteristic Imsets for Bayesian Networks
title_short Polytopes Arising from Binary Multi-way Contingency Tables and Characteristic Imsets for Bayesian Networks
title_full Polytopes Arising from Binary Multi-way Contingency Tables and Characteristic Imsets for Bayesian Networks
title_fullStr Polytopes Arising from Binary Multi-way Contingency Tables and Characteristic Imsets for Bayesian Networks
title_full_unstemmed Polytopes Arising from Binary Multi-way Contingency Tables and Characteristic Imsets for Bayesian Networks
title_sort polytopes arising from binary multi-way contingency tables and characteristic imsets for bayesian networks
publisher UKnowledge
publishDate 2013
url http://uknowledge.uky.edu/statistics_etds/5
http://uknowledge.uky.edu/cgi/viewcontent.cgi?article=1008&context=statistics_etds
work_keys_str_mv AT xijing polytopesarisingfrombinarymultiwaycontingencytablesandcharacteristicimsetsforbayesiannetworks
_version_ 1716801035253579776