%HPGLIMMIX: A High-Performance SAS Macro for GLMM Estimation
Generalized linear mixed models (GLMMs) comprise a class of widely used statistical tools for data analysis with fixed and random effects when the response variable has a conditional distribution in the exponential family. GLMM analysis also has a close relationship with actuarial credibility theory...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Foundation for Open Access Statistics
2014-06-01
|
Series: | Journal of Statistical Software |
Online Access: | http://www.jstatsoft.org/index.php/jss/article/view/2151 |
id |
doaj-c606194697a148728d13830d43f419fb |
---|---|
record_format |
Article |
spelling |
doaj-c606194697a148728d13830d43f419fb2020-11-24T22:18:15ZengFoundation for Open Access StatisticsJournal of Statistical Software1548-76602014-06-0158112510.18637/jss.v058.i08755%HPGLIMMIX: A High-Performance SAS Macro for GLMM EstimationLiang XieLaurence V. MaddenGeneralized linear mixed models (GLMMs) comprise a class of widely used statistical tools for data analysis with fixed and random effects when the response variable has a conditional distribution in the exponential family. GLMM analysis also has a close relationship with actuarial credibility theory. While readily available programs such as the GLIMMIX procedure in SAS and the lme4 package in R are powerful tools for using this class of models, these progarms are not able to handle models with thousands of levels of fixed and random effects. By using sparse-matrix and other high performance techniques, procedures such as HPMIXED in SAS can easily fit models with thousands of factor levels, but only for normally distributed response variables. In this paper, we present the %HPGLIMMIX SAS macro that fits GLMMs with large number of sparsely populated design matrices using the doubly-iterative linearization (pseudo-likelihood) method, in which the sparse-matrix-based HPMIXED is used for the inner iterations with the pseudo-variable constructed from the inverse-link function and the chosen model. Although the macro does not have the full functionality of the GLIMMIX procedure, time and memory savings can be large with the new macro. In applications in which design matrices contain many zeros and there are hundreds or thousands of factor levels, models can be fitted without exhausting computer memory, and 90% or better reduction in running time can be observed. Examples with a Poisson, binomial, and gamma conditional distribution are presented to demonstrate the usage and efficiency of this macro.http://www.jstatsoft.org/index.php/jss/article/view/2151 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Liang Xie Laurence V. Madden |
spellingShingle |
Liang Xie Laurence V. Madden %HPGLIMMIX: A High-Performance SAS Macro for GLMM Estimation Journal of Statistical Software |
author_facet |
Liang Xie Laurence V. Madden |
author_sort |
Liang Xie |
title |
%HPGLIMMIX: A High-Performance SAS Macro for GLMM Estimation |
title_short |
%HPGLIMMIX: A High-Performance SAS Macro for GLMM Estimation |
title_full |
%HPGLIMMIX: A High-Performance SAS Macro for GLMM Estimation |
title_fullStr |
%HPGLIMMIX: A High-Performance SAS Macro for GLMM Estimation |
title_full_unstemmed |
%HPGLIMMIX: A High-Performance SAS Macro for GLMM Estimation |
title_sort |
%hpglimmix: a high-performance sas macro for glmm estimation |
publisher |
Foundation for Open Access Statistics |
series |
Journal of Statistical Software |
issn |
1548-7660 |
publishDate |
2014-06-01 |
description |
Generalized linear mixed models (GLMMs) comprise a class of widely used statistical tools for data analysis with fixed and random effects when the response variable has a conditional distribution in the exponential family. GLMM analysis also has a close relationship with actuarial credibility theory. While readily available programs such as the GLIMMIX procedure in SAS and the lme4 package in R are powerful tools for using this class of models, these progarms are not able to handle models with thousands of levels of fixed and random effects. By using sparse-matrix and other high performance techniques, procedures such as HPMIXED in SAS can easily fit models with thousands of factor levels, but only for normally distributed response variables. In this paper, we present the %HPGLIMMIX SAS macro that fits GLMMs with large number of sparsely populated design matrices using the doubly-iterative linearization (pseudo-likelihood) method, in which the sparse-matrix-based HPMIXED is used for the inner iterations with the pseudo-variable constructed from the inverse-link function and the chosen model. Although the macro does not have the full functionality of the GLIMMIX procedure, time and memory savings can be large with the new macro. In applications in which design matrices contain many zeros and there are hundreds or thousands of factor levels, models can be fitted without exhausting computer memory, and 90% or better reduction in running time can be observed. Examples with a Poisson, binomial, and gamma conditional distribution are presented to demonstrate the usage and efficiency of this macro. |
url |
http://www.jstatsoft.org/index.php/jss/article/view/2151 |
work_keys_str_mv |
AT liangxie hpglimmixahighperformancesasmacroforglmmestimation AT laurencevmadden hpglimmixahighperformancesasmacroforglmmestimation |
_version_ |
1725782666661330944 |