Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood
Main Author: | |
---|---|
Language: | English |
Published: |
Bowling Green State University / OhioLINK
2016
|
Subjects: | |
Online Access: | http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1466633921 |
id |
ndltd-OhioLink-oai-etd.ohiolink.edu-bgsu1466633921 |
---|---|
record_format |
oai_dc |
collection |
NDLTD |
language |
English |
sources |
NDLTD |
topic |
Statistics model selection linear mixed models oracle properties |
spellingShingle |
Statistics model selection linear mixed models oracle properties Pan, Juming Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood |
author |
Pan, Juming |
author_facet |
Pan, Juming |
author_sort |
Pan, Juming |
title |
Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood |
title_short |
Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood |
title_full |
Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood |
title_fullStr |
Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood |
title_full_unstemmed |
Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood |
title_sort |
adaptive lasso for mixed model selection via profile log-likelihood |
publisher |
Bowling Green State University / OhioLINK |
publishDate |
2016 |
url |
http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1466633921 |
work_keys_str_mv |
AT panjuming adaptivelassoformixedmodelselectionviaprofileloglikelihood |
_version_ |
1719440028940107776 |
spelling |
ndltd-OhioLink-oai-etd.ohiolink.edu-bgsu14666339212021-08-03T06:37:06Z Adaptive LASSO For Mixed Model Selection via Profile Log-Likelihood Pan, Juming Statistics model selection linear mixed models oracle properties Linear mixed models describe the relationship between a response variable and some predictors for data that are grouped according to one or more clustering factors. A linear mixed model consists of both fixed effects and random effects. Fixed effects are the conventional linear regression coefficients, and random effects are associated with units which are drawn randomly from a population. By accommodating such two types of parameters, linear mixed models provide an effective and flexible way of representing the means as well as the covariance structure of the data, therefore have been primarily used to model correlated data, and have received much attention in a variety of disciplines including agriculture, biology, medicine, and sociology.Due to the complex nature of the linear mixed models, the selection of only important covariates to create an interpretable model becomes challenging as the dimension of fixed or random effects increases. Thus, determining an appropriate structural form for a model to be used in making inferences and predictions is a fundamental problem in the analysis of longitudinal or clustered data using linear mixed models.This dissertation focuses on selection and estimation for linear mixed models by integratingthe recent advances in model selection. More specifically, we propose a two-stage penalized procedure for selecting and estimating important fixed and random effects. Compared with the traditional subset selection approaches, penalized methods can enhance the predictive power of a model, and can significantly reduce computational cost when the number of variables is large (Fan and Li, 2001). Our proposed procedure is different from the existing ones in theliterature mainly in two aspects. First, the proposed method is composed of twostages to separately choose the parameters of interests, therefore can respectand accommodate the distinct properties between the random and fixed effects. Second, the usage of the profile log-likelihoods in the selection process can make the computation more efficient and stable due to a smaller number of dimensions involved.In the first stage, we choose the random effects by maximizing the penalized restricted profile log-likelihood, and the maximization is completed by the Newton-Raphson algorithm. Observe that if a random effect is a noise variable, then the corresponding variance components should be all zero. Thus, we first estimate the covariance matrix of random effects using the adaptive LASSO penalized method and then identify the vital ones based on the estimated covariance matrix. In the view of such a selection procedure, the selectedrandom effects are invariant to the selection of the fixed effects. Whena proper model for the covariance is adopted, the correct covariancestructure will be obtained and valid inferences for the fixed effects can thenbe achieved in the next stage. We further study the theoretical propertiesof the proposed procedure for random effects selection. Weprove that, with probability tending to one, the proposedprocedure surely identifies all true random effects.After the completion of the random effects selection, in the secondstage, we select the fixed effects through the maximization of the penalized profile log-likelihood, which only involves the regression coefficients. The optimization of the penalized profile log-likelihood can be solved by the Newton-Raphson algorithm. We then investigate thesampling properties of the resulting estimate of fixed effects. We show that the resulting estimate enjoysmodel selection oracle properties, indicating that asymptotically the proposed approachcan discover the subset of significant predictors. After finishing the two-stage penalized procedure, the best linear mixed model cansubsequently be determined and be applied to handle correlated data in a number of fields.To illustrate the performance of the proposed method, numerous simulation studies have been conducted. The simulation results demonstrate that the proposed technique is quite efficient in selecting the best covariates and random covariance structure in linear mixed models and outperforms the existing selection methodologies in general. We finally apply the method to two real data applications for further examining its effectiveness in mixed model selection. 2016-07-18 English text Bowling Green State University / OhioLINK http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1466633921 http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1466633921 unrestricted This thesis or dissertation is protected by copyright: all rights reserved. It may not be copied or redistributed beyond the terms of applicable copyright laws. |