%HPGLIMMIX: A High-Performance SAS Macro for GLMM Estimation

Generalized linear mixed models (GLMMs) comprise a class of widely used statistical tools for data analysis with fixed and random effects when the response variable has a conditional distribution in the exponential family. GLMM analysis also has a close relationship with actuarial credibility theory...

Full description

Bibliographic Details
Main Authors: Liang Xie, Laurence V. Madden
Format: Article
Language:English
Published: Foundation for Open Access Statistics 2014-06-01
Series:Journal of Statistical Software
Online Access:http://www.jstatsoft.org/index.php/jss/article/view/2151
id doaj-c606194697a148728d13830d43f419fb
record_format Article
spelling doaj-c606194697a148728d13830d43f419fb2020-11-24T22:18:15ZengFoundation for Open Access StatisticsJournal of Statistical Software1548-76602014-06-0158112510.18637/jss.v058.i08755%HPGLIMMIX: A High-Performance SAS Macro for GLMM EstimationLiang XieLaurence V. MaddenGeneralized linear mixed models (GLMMs) comprise a class of widely used statistical tools for data analysis with fixed and random effects when the response variable has a conditional distribution in the exponential family. GLMM analysis also has a close relationship with actuarial credibility theory. While readily available programs such as the GLIMMIX procedure in SAS and the lme4 package in R are powerful tools for using this class of models, these progarms are not able to handle models with thousands of levels of fixed and random effects. By using sparse-matrix and other high performance techniques, procedures such as HPMIXED in SAS can easily fit models with thousands of factor levels, but only for normally distributed response variables. In this paper, we present the %HPGLIMMIX SAS macro that fits GLMMs with large number of sparsely populated design matrices using the doubly-iterative linearization (pseudo-likelihood) method, in which the sparse-matrix-based HPMIXED is used for the inner iterations with the pseudo-variable constructed from the inverse-link function and the chosen model. Although the macro does not have the full functionality of the GLIMMIX procedure, time and memory savings can be large with the new macro. In applications in which design matrices contain many zeros and there are hundreds or thousands of factor levels, models can be fitted without exhausting computer memory, and 90% or better reduction in running time can be observed. Examples with a Poisson, binomial, and gamma conditional distribution are presented to demonstrate the usage and efficiency of this macro.http://www.jstatsoft.org/index.php/jss/article/view/2151
collection DOAJ
language English
format Article
sources DOAJ
author Liang Xie
Laurence V. Madden
spellingShingle Liang Xie
Laurence V. Madden
%HPGLIMMIX: A High-Performance SAS Macro for GLMM Estimation
Journal of Statistical Software
author_facet Liang Xie
Laurence V. Madden
author_sort Liang Xie
title %HPGLIMMIX: A High-Performance SAS Macro for GLMM Estimation
title_short %HPGLIMMIX: A High-Performance SAS Macro for GLMM Estimation
title_full %HPGLIMMIX: A High-Performance SAS Macro for GLMM Estimation
title_fullStr %HPGLIMMIX: A High-Performance SAS Macro for GLMM Estimation
title_full_unstemmed %HPGLIMMIX: A High-Performance SAS Macro for GLMM Estimation
title_sort %hpglimmix: a high-performance sas macro for glmm estimation
publisher Foundation for Open Access Statistics
series Journal of Statistical Software
issn 1548-7660
publishDate 2014-06-01
description Generalized linear mixed models (GLMMs) comprise a class of widely used statistical tools for data analysis with fixed and random effects when the response variable has a conditional distribution in the exponential family. GLMM analysis also has a close relationship with actuarial credibility theory. While readily available programs such as the GLIMMIX procedure in SAS and the lme4 package in R are powerful tools for using this class of models, these progarms are not able to handle models with thousands of levels of fixed and random effects. By using sparse-matrix and other high performance techniques, procedures such as HPMIXED in SAS can easily fit models with thousands of factor levels, but only for normally distributed response variables. In this paper, we present the %HPGLIMMIX SAS macro that fits GLMMs with large number of sparsely populated design matrices using the doubly-iterative linearization (pseudo-likelihood) method, in which the sparse-matrix-based HPMIXED is used for the inner iterations with the pseudo-variable constructed from the inverse-link function and the chosen model. Although the macro does not have the full functionality of the GLIMMIX procedure, time and memory savings can be large with the new macro. In applications in which design matrices contain many zeros and there are hundreds or thousands of factor levels, models can be fitted without exhausting computer memory, and 90% or better reduction in running time can be observed. Examples with a Poisson, binomial, and gamma conditional distribution are presented to demonstrate the usage and efficiency of this macro.
url http://www.jstatsoft.org/index.php/jss/article/view/2151
work_keys_str_mv AT liangxie hpglimmixahighperformancesasmacroforglmmestimation
AT laurencevmadden hpglimmixahighperformancesasmacroforglmmestimation
_version_ 1725782666661330944