Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood

Bibliographic Details
Main Author: Pan, Juming
Language:English
Published: Bowling Green State University / OhioLINK 2016
Subjects:
Online Access:http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1466633921
id ndltd-OhioLink-oai-etd.ohiolink.edu-bgsu1466633921
record_format oai_dc
collection NDLTD
language English
sources NDLTD
topic Statistics
model selection
linear mixed models
oracle properties
spellingShingle Statistics
model selection
linear mixed models
oracle properties
Pan, Juming
Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood
author Pan, Juming
author_facet Pan, Juming
author_sort Pan, Juming
title Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood
title_short Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood
title_full Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood
title_fullStr Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood
title_full_unstemmed Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood
title_sort adaptive lasso for mixed model selection via profile log-likelihood
publisher Bowling Green State University / OhioLINK
publishDate 2016
url http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1466633921
work_keys_str_mv AT panjuming adaptivelassoformixedmodelselectionviaprofileloglikelihood
_version_ 1719440028940107776
spelling ndltd-OhioLink-oai-etd.ohiolink.edu-bgsu14666339212021-08-03T06:37:06Z Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood Pan, Juming Statistics model selection linear mixed models oracle properties Linear mixed models describe the relationship between a response variable and some predictors for data that are grouped according to one or more clustering factors. A linear mixed model consists of both fixed effects and random effects. Fixed effects are the conventional linear regression coefficients, and random effects are associated with units which are drawn randomly from a population. By accommodating such two types of parameters, linear mixed models provide an effective and flexible way of representing the means as well as the covariance structure of the data, therefore have been primarily used to model correlated data, and have received much attention in a variety of disciplines including agriculture, biology, medicine, and sociology.Due to the complex nature of the linear mixed models, the selection of only important covariates to create an interpretable model becomes challenging as the dimension of fixed or random effects increases. Thus, determining an appropriate structural form for a model to be used in making inferences and predictions is a fundamental problem in the analysis of longitudinal or clustered data using linear mixed models.This dissertation focuses on selection and estimation for linear mixed models by integratingthe recent advances in model selection. More specifically, we propose a two-stage penalized procedure for selecting and estimating important fixed and random effects. Compared with the traditional subset selection approaches, penalized methods can enhance the predictive power of a model, and can significantly reduce computational cost when the number of variables is large (Fan and Li, 2001). Our proposed procedure is different from the existing ones in theliterature mainly in two aspects. First, the proposed method is composed of twostages to separately choose the parameters of interests, therefore can respectand accommodate the distinct properties between the random and fixed effects. Second, the usage of the profile log-likelihoods in the selection process can make the computation more efficient and stable due to a smaller number of dimensions involved.In the first stage, we choose the random effects by maximizing the penalized restricted profile log-likelihood, and the maximization is completed by the Newton-Raphson algorithm. Observe that if a random effect is a noise variable, then the corresponding variance components should be all zero. Thus, we first estimate the covariance matrix of random effects using the adaptive LASSO penalized method and then identify the vital ones based on the estimated covariance matrix. In the view of such a selection procedure, the selectedrandom effects are invariant to the selection of the fixed effects. Whena proper model for the covariance is adopted, the correct covariancestructure will be obtained and valid inferences for the fixed effects can thenbe achieved in the next stage. We further study the theoretical propertiesof the proposed procedure for random effects selection. Weprove that, with probability tending to one, the proposedprocedure surely identifies all true random effects.After the completion of the random effects selection, in the secondstage, we select the fixed effects through the maximization of the penalized profile log-likelihood, which only involves the regression coefficients. The optimization of the penalized profile log-likelihood can be solved by the Newton-Raphson algorithm. We then investigate thesampling properties of the resulting estimate of fixed effects. We show that the resulting estimate enjoysmodel selection oracle properties, indicating that asymptotically the proposed approachcan discover the subset of significant predictors. After finishing the two-stage penalized procedure, the best linear mixed model cansubsequently be determined and be applied to handle correlated data in a number of fields.To illustrate the performance of the proposed method, numerous simulation studies have been conducted. The simulation results demonstrate that the proposed technique is quite efficient in selecting the best covariates and random covariance structure in linear mixed models and outperforms the existing selection methodologies in general. We finally apply the method to two real data applications for further examining its effectiveness in mixed model selection. 2016-07-18 English text Bowling Green State University / OhioLINK http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1466633921 http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1466633921 unrestricted This thesis or dissertation is protected by copyright: all rights reserved. It may not be copied or redistributed beyond the terms of applicable copyright laws.