Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models

Bayesian Model Averaging (BMA) is a new area in modern applied statistics that provides data analysts with an efficient tool for discovering promising models and obtaining esti-mates of their posterior probabilities via Markov chain Monte Carlo (MCMC). These probabilities can be further used as weig...

Full description

Bibliographic Details
Main Author: Lipkovich, Ilya A.
Other Authors: Statistics
Format: Others
Published: Virginia Tech 2011
Subjects:
Online Access:http://hdl.handle.net/10919/11045
http://scholar.lib.vt.edu/theses/available/etd-04182002-020608
id ndltd-VTETD-oai-vtechworks.lib.vt.edu-10919-11045
record_format oai_dc
spelling ndltd-VTETD-oai-vtechworks.lib.vt.edu-10919-110452020-09-29T05:31:28Z Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models Lipkovich, Ilya A. Statistics Smith, Eric P. Foutz, Robert Birch, Jeffrey B. Terrell, George R. Ye, Keying Canonical Correspondence Analysis Outlier Analysis Bayesian Model Averaging Cluster Analysis Bayesian Model Averaging (BMA) is a new area in modern applied statistics that provides data analysts with an efficient tool for discovering promising models and obtaining esti-mates of their posterior probabilities via Markov chain Monte Carlo (MCMC). These probabilities can be further used as weights for model averaged predictions and estimates of the parameters of interest. As a result, variance components due to model selection are estimated and accounted for, contrary to the practice of conventional data analysis (such as, for example, stepwise model selection). In addition, variable activation probabilities can be obtained for each variable of interest. This dissertation is aimed at connecting BMA and various ramifications of the multivari-ate technique called Reduced-Rank Regression (RRR). In particular, we are concerned with Canonical Correspondence Analysis (CCA) in ecological applications where the data are represented by a site by species abundance matrix with site-specific covariates. Our goal is to incorporate the multivariate techniques, such as Redundancy Analysis and Ca-nonical Correspondence Analysis into the general machinery of BMA, taking into account such complicating phenomena as outliers and clustering of observations within a single data-analysis strategy. Traditional implementations of model averaging are concerned with selection of variables. We extend the methodology of BMA to selection of subgroups of observations and im-plement several approaches to cluster and outlier analysis in the context of the multivari-ate regression model. The proposed algorithm of cluster analysis can accommodate re-strictions on the resulting partition of observations when some of them form sub-clusters that have to be preserved when larger clusters are formed. Ph. D. 2011-08-22T18:35:37Z 2011-08-22T18:35:37Z 2002-04-09 2002-04-18 2003-04-22 2002-04-22 Dissertation etd-04182002-020608 http://hdl.handle.net/10919/11045 http://scholar.lib.vt.edu/theses/available/etd-04182002-020608 BMADisserFinal.pdf In Copyright http://rightsstatements.org/vocab/InC/1.0/ ETD application/pdf Virginia Tech
collection NDLTD
format Others
sources NDLTD
topic Canonical Correspondence Analysis
Outlier Analysis
Bayesian Model Averaging
Cluster Analysis
spellingShingle Canonical Correspondence Analysis
Outlier Analysis
Bayesian Model Averaging
Cluster Analysis
Lipkovich, Ilya A.
Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models
description Bayesian Model Averaging (BMA) is a new area in modern applied statistics that provides data analysts with an efficient tool for discovering promising models and obtaining esti-mates of their posterior probabilities via Markov chain Monte Carlo (MCMC). These probabilities can be further used as weights for model averaged predictions and estimates of the parameters of interest. As a result, variance components due to model selection are estimated and accounted for, contrary to the practice of conventional data analysis (such as, for example, stepwise model selection). In addition, variable activation probabilities can be obtained for each variable of interest. This dissertation is aimed at connecting BMA and various ramifications of the multivari-ate technique called Reduced-Rank Regression (RRR). In particular, we are concerned with Canonical Correspondence Analysis (CCA) in ecological applications where the data are represented by a site by species abundance matrix with site-specific covariates. Our goal is to incorporate the multivariate techniques, such as Redundancy Analysis and Ca-nonical Correspondence Analysis into the general machinery of BMA, taking into account such complicating phenomena as outliers and clustering of observations within a single data-analysis strategy. Traditional implementations of model averaging are concerned with selection of variables. We extend the methodology of BMA to selection of subgroups of observations and im-plement several approaches to cluster and outlier analysis in the context of the multivari-ate regression model. The proposed algorithm of cluster analysis can accommodate re-strictions on the resulting partition of observations when some of them form sub-clusters that have to be preserved when larger clusters are formed. === Ph. D.
author2 Statistics
author_facet Statistics
Lipkovich, Ilya A.
author Lipkovich, Ilya A.
author_sort Lipkovich, Ilya A.
title Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models
title_short Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models
title_full Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models
title_fullStr Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models
title_full_unstemmed Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models
title_sort bayesian model averaging and variable selection in multivariate ecological models
publisher Virginia Tech
publishDate 2011
url http://hdl.handle.net/10919/11045
http://scholar.lib.vt.edu/theses/available/etd-04182002-020608
work_keys_str_mv AT lipkovichilyaa bayesianmodelaveragingandvariableselectioninmultivariateecologicalmodels
_version_ 1719343267514941440