Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling

Climate projections simulated by Global Climate Models (GCMs) are often used for assessing the impacts of climate change. However, the relatively coarse resolutions of GCM outputs often preclude their application to accurately assessing the effects of climate change on finer regional-scale phenomena...

Full description

Bibliographic Details
Main Authors: D. Das, J. Dy, J. Ross, Z. Obradovic, A. R. Ganguly
Format: Article
Language:English
Published: Copernicus Publications 2014-12-01
Series:Nonlinear Processes in Geophysics
Online Access:http://www.nonlin-processes-geophys.net/21/1145/2014/npg-21-1145-2014.pdf
id doaj-52a5df384fdb449783df70597fd9b7b0
record_format Article
spelling doaj-52a5df384fdb449783df70597fd9b7b02020-11-24T22:51:13ZengCopernicus PublicationsNonlinear Processes in Geophysics1023-58091607-79462014-12-012161145115710.5194/npg-21-1145-2014Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscalingD. Das0J. Dy1J. Ross2Z. Obradovic3A. R. Ganguly4Sustainability and Data Sciences Lab, Northeastern University, Boston, MA, USADepartment of Electrical and Computer Engineering, Northeastern University, Boston, MA, USADepartment of Electrical and Computer Engineering, Northeastern University, Boston, MA, USACenter for Data Analytics and Biomedical Informatics, Temple University, Philadelphia, PA, USASustainability and Data Sciences Lab, Northeastern University, Boston, MA, USAClimate projections simulated by Global Climate Models (GCMs) are often used for assessing the impacts of climate change. However, the relatively coarse resolutions of GCM outputs often preclude their application to accurately assessing the effects of climate change on finer regional-scale phenomena. Downscaling of climate variables from coarser to finer regional scales using statistical methods is often performed for regional climate projections. Statistical downscaling (SD) is based on the understanding that the regional climate is influenced by two factors – the large-scale climatic state and the regional or local features. A transfer function approach of SD involves learning a regression model that relates these features (predictors) to a climatic variable of interest (predictand) based on the past observations. However, often a single regression model is not sufficient to describe complex dynamic relationships between the predictors and predictand. We focus on the covariate selection part of the transfer function approach and propose a nonparametric Bayesian mixture of sparse regression models based on Dirichlet process (DP) for simultaneous clustering and discovery of covariates within the clusters while automatically finding the number of clusters. Sparse linear models are parsimonious and hence more generalizable than non-sparse alternatives, and lend themselves to domain relevant interpretation. Applications to synthetic data demonstrate the value of the new approach and preliminary results related to feature selection for statistical downscaling show that our method can lead to new insights.http://www.nonlin-processes-geophys.net/21/1145/2014/npg-21-1145-2014.pdf
collection DOAJ
language English
format Article
sources DOAJ
author D. Das
J. Dy
J. Ross
Z. Obradovic
A. R. Ganguly
spellingShingle D. Das
J. Dy
J. Ross
Z. Obradovic
A. R. Ganguly
Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling
Nonlinear Processes in Geophysics
author_facet D. Das
J. Dy
J. Ross
Z. Obradovic
A. R. Ganguly
author_sort D. Das
title Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling
title_short Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling
title_full Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling
title_fullStr Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling
title_full_unstemmed Non-parametric Bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling
title_sort non-parametric bayesian mixture of sparse regressions with application towards feature selection for statistical downscaling
publisher Copernicus Publications
series Nonlinear Processes in Geophysics
issn 1023-5809
1607-7946
publishDate 2014-12-01
description Climate projections simulated by Global Climate Models (GCMs) are often used for assessing the impacts of climate change. However, the relatively coarse resolutions of GCM outputs often preclude their application to accurately assessing the effects of climate change on finer regional-scale phenomena. Downscaling of climate variables from coarser to finer regional scales using statistical methods is often performed for regional climate projections. Statistical downscaling (SD) is based on the understanding that the regional climate is influenced by two factors – the large-scale climatic state and the regional or local features. A transfer function approach of SD involves learning a regression model that relates these features (predictors) to a climatic variable of interest (predictand) based on the past observations. However, often a single regression model is not sufficient to describe complex dynamic relationships between the predictors and predictand. We focus on the covariate selection part of the transfer function approach and propose a nonparametric Bayesian mixture of sparse regression models based on Dirichlet process (DP) for simultaneous clustering and discovery of covariates within the clusters while automatically finding the number of clusters. Sparse linear models are parsimonious and hence more generalizable than non-sparse alternatives, and lend themselves to domain relevant interpretation. Applications to synthetic data demonstrate the value of the new approach and preliminary results related to feature selection for statistical downscaling show that our method can lead to new insights.
url http://www.nonlin-processes-geophys.net/21/1145/2014/npg-21-1145-2014.pdf
work_keys_str_mv AT ddas nonparametricbayesianmixtureofsparseregressionswithapplicationtowardsfeatureselectionforstatisticaldownscaling
AT jdy nonparametricbayesianmixtureofsparseregressionswithapplicationtowardsfeatureselectionforstatisticaldownscaling
AT jross nonparametricbayesianmixtureofsparseregressionswithapplicationtowardsfeatureselectionforstatisticaldownscaling
AT zobradovic nonparametricbayesianmixtureofsparseregressionswithapplicationtowardsfeatureselectionforstatisticaldownscaling
AT arganguly nonparametricbayesianmixtureofsparseregressionswithapplicationtowardsfeatureselectionforstatisticaldownscaling
_version_ 1725670857497378816