Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling
Climate projections simulated by Global Climate Models (GCMs) are often used for assessing the impacts of climate change. However, the relatively coarse resolutions of GCM outputs often preclude their application to accurately assessing the effects of climate change on finer regional-scale phenomena...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Copernicus Publications
2014-12-01
|
Series: | Nonlinear Processes in Geophysics |
Online Access: | http://www.nonlin-processes-geophys.net/21/1145/2014/npg-21-1145-2014.pdf |
id |
doaj-52a5df384fdb449783df70597fd9b7b0 |
---|---|
record_format |
Article |
spelling |
doaj-52a5df384fdb449783df70597fd9b7b02020-11-24T22:51:13ZengCopernicus PublicationsNonlinear Processes in Geophysics1023-58091607-79462014-12-012161145115710.5194/npg-21-1145-2014Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscalingD. Das0J. Dy1J. Ross2Z. Obradovic3A. R. Ganguly4Sustainability and Data Sciences Lab, Northeastern University, Boston, MA, USADepartment of Electrical and Computer Engineering, Northeastern University, Boston, MA, USADepartment of Electrical and Computer Engineering, Northeastern University, Boston, MA, USACenter for Data Analytics and Biomedical Informatics, Temple University, Philadelphia, PA, USASustainability and Data Sciences Lab, Northeastern University, Boston, MA, USAClimate projections simulated by Global Climate Models (GCMs) are often used for assessing the impacts of climate change. However, the relatively coarse resolutions of GCM outputs often preclude their application to accurately assessing the effects of climate change on finer regional-scale phenomena. Downscaling of climate variables from coarser to finer regional scales using statistical methods is often performed for regional climate projections. Statistical downscaling (SD) is based on the understanding that the regional climate is influenced by two factors – the large-scale climatic state and the regional or local features. A transfer function approach of SD involves learning a regression model that relates these features (predictors) to a climatic variable of interest (predictand) based on the past observations. However, often a single regression model is not sufficient to describe complex dynamic relationships between the predictors and predictand. We focus on the covariate selection part of the transfer function approach and propose a nonparametric Bayesian mixture of sparse regression models based on Dirichlet process (DP) for simultaneous clustering and discovery of covariates within the clusters while automatically finding the number of clusters. Sparse linear models are parsimonious and hence more generalizable than non-sparse alternatives, and lend themselves to domain relevant interpretation. Applications to synthetic data demonstrate the value of the new approach and preliminary results related to feature selection for statistical downscaling show that our method can lead to new insights.http://www.nonlin-processes-geophys.net/21/1145/2014/npg-21-1145-2014.pdf |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
D. Das J. Dy J. Ross Z. Obradovic A. R. Ganguly |
spellingShingle |
D. Das J. Dy J. Ross Z. Obradovic A. R. Ganguly Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling Nonlinear Processes in Geophysics |
author_facet |
D. Das J. Dy J. Ross Z. Obradovic A. R. Ganguly |
author_sort |
D. Das |
title |
Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling |
title_short |
Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling |
title_full |
Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling |
title_fullStr |
Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling |
title_full_unstemmed |
Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling |
title_sort |
non-parametric bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling |
publisher |
Copernicus Publications |
series |
Nonlinear Processes in Geophysics |
issn |
1023-5809 1607-7946 |
publishDate |
2014-12-01 |
description |
Climate projections simulated by Global Climate Models (GCMs) are often used
for assessing the impacts of climate change. However, the relatively coarse
resolutions of GCM outputs often preclude their application to
accurately assessing the effects of climate change on finer regional-scale
phenomena. Downscaling of climate variables from coarser to finer regional
scales using statistical methods is often performed for regional climate
projections. Statistical downscaling (SD) is based on the understanding that
the regional climate is influenced by two factors – the large-scale climatic
state and the regional or local features. A transfer function approach of SD
involves learning a regression model that relates these features
(predictors) to a climatic variable of interest (predictand) based on the
past observations. However, often a single regression model is not sufficient
to describe complex dynamic relationships between the predictors and
predictand. We focus on the covariate selection part of the transfer function
approach and propose a nonparametric Bayesian mixture of sparse regression
models based on Dirichlet process (DP) for simultaneous clustering and
discovery of covariates within the clusters while automatically finding the
number of clusters. Sparse linear models are parsimonious and hence
more generalizable than non-sparse alternatives, and lend themselves to
domain relevant interpretation. Applications to synthetic data demonstrate
the value of the new approach and preliminary results related to feature
selection for statistical downscaling show that our method can lead to new insights. |
url |
http://www.nonlin-processes-geophys.net/21/1145/2014/npg-21-1145-2014.pdf |
work_keys_str_mv |
AT ddas nonparametricbayesianmixtureofsparseregressionswithapplicationtowardsfeatureselectionforstatisticaldownscaling AT jdy nonparametricbayesianmixtureofsparseregressionswithapplicationtowardsfeatureselectionforstatisticaldownscaling AT jross nonparametricbayesianmixtureofsparseregressionswithapplicationtowardsfeatureselectionforstatisticaldownscaling AT zobradovic nonparametricbayesianmixtureofsparseregressionswithapplicationtowardsfeatureselectionforstatisticaldownscaling AT arganguly nonparametricbayesianmixtureofsparseregressionswithapplicationtowardsfeatureselectionforstatisticaldownscaling |
_version_ |
1725670857497378816 |