Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models
Bayesian Model Averaging (BMA) is a new area in modern applied statistics that provides data analysts with an efficient tool for discovering promising models and obtaining esti-mates of their posterior probabilities via Markov chain Monte Carlo (MCMC). These probabilities can be further used as weig...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Published: |
Virginia Tech
2011
|
Subjects: | |
Online Access: | http://hdl.handle.net/10919/11045 http://scholar.lib.vt.edu/theses/available/etd-04182002-020608 |
id |
ndltd-VTETD-oai-vtechworks.lib.vt.edu-10919-11045 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-VTETD-oai-vtechworks.lib.vt.edu-10919-110452020-09-29T05:31:28Z Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models Lipkovich, Ilya A. Statistics Smith, Eric P. Foutz, Robert Birch, Jeffrey B. Terrell, George R. Ye, Keying Canonical Correspondence Analysis Outlier Analysis Bayesian Model Averaging Cluster Analysis Bayesian Model Averaging (BMA) is a new area in modern applied statistics that provides data analysts with an efficient tool for discovering promising models and obtaining esti-mates of their posterior probabilities via Markov chain Monte Carlo (MCMC). These probabilities can be further used as weights for model averaged predictions and estimates of the parameters of interest. As a result, variance components due to model selection are estimated and accounted for, contrary to the practice of conventional data analysis (such as, for example, stepwise model selection). In addition, variable activation probabilities can be obtained for each variable of interest. This dissertation is aimed at connecting BMA and various ramifications of the multivari-ate technique called Reduced-Rank Regression (RRR). In particular, we are concerned with Canonical Correspondence Analysis (CCA) in ecological applications where the data are represented by a site by species abundance matrix with site-specific covariates. Our goal is to incorporate the multivariate techniques, such as Redundancy Analysis and Ca-nonical Correspondence Analysis into the general machinery of BMA, taking into account such complicating phenomena as outliers and clustering of observations within a single data-analysis strategy. Traditional implementations of model averaging are concerned with selection of variables. We extend the methodology of BMA to selection of subgroups of observations and im-plement several approaches to cluster and outlier analysis in the context of the multivari-ate regression model. The proposed algorithm of cluster analysis can accommodate re-strictions on the resulting partition of observations when some of them form sub-clusters that have to be preserved when larger clusters are formed. Ph. D. 2011-08-22T18:35:37Z 2011-08-22T18:35:37Z 2002-04-09 2002-04-18 2003-04-22 2002-04-22 Dissertation etd-04182002-020608 http://hdl.handle.net/10919/11045 http://scholar.lib.vt.edu/theses/available/etd-04182002-020608 BMADisserFinal.pdf In Copyright http://rightsstatements.org/vocab/InC/1.0/ ETD application/pdf Virginia Tech |
collection |
NDLTD |
format |
Others
|
sources |
NDLTD |
topic |
Canonical Correspondence Analysis Outlier Analysis Bayesian Model Averaging Cluster Analysis |
spellingShingle |
Canonical Correspondence Analysis Outlier Analysis Bayesian Model Averaging Cluster Analysis Lipkovich, Ilya A. Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models |
description |
Bayesian Model Averaging (BMA) is a new area in modern applied statistics that provides data analysts with an efficient tool for discovering promising models and obtaining esti-mates of their posterior probabilities via Markov chain Monte Carlo (MCMC). These probabilities can be further used as weights for model averaged predictions and estimates of the parameters of interest. As a result, variance components due to model selection are estimated and accounted for, contrary to the practice of conventional data analysis (such as, for example, stepwise model selection). In addition, variable activation probabilities can be obtained for each variable of interest. This dissertation is aimed at connecting BMA and various ramifications of the multivari-ate technique called Reduced-Rank Regression (RRR). In particular, we are concerned with Canonical Correspondence Analysis (CCA) in ecological applications where the data are represented by a site by species abundance matrix with site-specific covariates. Our goal is to incorporate the multivariate techniques, such as Redundancy Analysis and Ca-nonical Correspondence Analysis into the general machinery of BMA, taking into account such complicating phenomena as outliers and clustering of observations within a single data-analysis strategy. Traditional implementations of model averaging are concerned with selection of variables. We extend the methodology of BMA to selection of subgroups of observations and im-plement several approaches to cluster and outlier analysis in the context of the multivari-ate regression model. The proposed algorithm of cluster analysis can accommodate re-strictions on the resulting partition of observations when some of them form sub-clusters that have to be preserved when larger clusters are formed. === Ph. D. |
author2 |
Statistics |
author_facet |
Statistics Lipkovich, Ilya A. |
author |
Lipkovich, Ilya A. |
author_sort |
Lipkovich, Ilya A. |
title |
Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models |
title_short |
Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models |
title_full |
Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models |
title_fullStr |
Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models |
title_full_unstemmed |
Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models |
title_sort |
bayesian model averaging and variable selection in multivariate ecological models |
publisher |
Virginia Tech |
publishDate |
2011 |
url |
http://hdl.handle.net/10919/11045 http://scholar.lib.vt.edu/theses/available/etd-04182002-020608 |
work_keys_str_mv |
AT lipkovichilyaa bayesianmodelaveragingandvariableselectioninmultivariateecologicalmodels |
_version_ |
1719343267514941440 |