SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models
We revisit sure independence screening procedures for variable selection in generalized linear models and the Cox proportional hazards model. Through the publicly available R package SIS, we provide a unified environment to carry out variable selection using iterative sure independence screening (IS...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Foundation for Open Access Statistics
2018-02-01
|
Series: | Journal of Statistical Software |
Subjects: | |
Online Access: | https://www.jstatsoft.org/index.php/jss/article/view/3378 |
id |
doaj-bcfd41b5e8b64cdd9a1b504541f0bf7d |
---|---|
record_format |
Article |
spelling |
doaj-bcfd41b5e8b64cdd9a1b504541f0bf7d2020-11-24T22:29:48ZengFoundation for Open Access StatisticsJournal of Statistical Software1548-76602018-02-0183112510.18637/jss.v083.i021183SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical ModelsDiego Franco SaldanaYang FengWe revisit sure independence screening procedures for variable selection in generalized linear models and the Cox proportional hazards model. Through the publicly available R package SIS, we provide a unified environment to carry out variable selection using iterative sure independence screening (ISIS) and all of its variants. For the regularization steps in the ISIS recruiting process, available penalties include the LASSO, SCAD, and MCP while the implemented variants for the screening steps are sample splitting, data-driven thresholding, and combinations thereof. Performance of these feature selection techniques is investigated by means of real and simulated data sets, where we find considerable improvements in terms of model selection and computational time between our algorithms and traditional penalized pseudo-likelihood methods applied directly to the full set of covariates.https://www.jstatsoft.org/index.php/jss/article/view/3378Cox modelgeneralized linear modelspenalized likelihood estimationsparsitysure independence screeningvariable selection |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Diego Franco Saldana Yang Feng |
spellingShingle |
Diego Franco Saldana Yang Feng SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models Journal of Statistical Software Cox model generalized linear models penalized likelihood estimation sparsity sure independence screening variable selection |
author_facet |
Diego Franco Saldana Yang Feng |
author_sort |
Diego Franco Saldana |
title |
SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models |
title_short |
SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models |
title_full |
SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models |
title_fullStr |
SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models |
title_full_unstemmed |
SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models |
title_sort |
sis: an r package for sure independence screening in ultrahigh-dimensional statistical models |
publisher |
Foundation for Open Access Statistics |
series |
Journal of Statistical Software |
issn |
1548-7660 |
publishDate |
2018-02-01 |
description |
We revisit sure independence screening procedures for variable selection in generalized linear models and the Cox proportional hazards model. Through the publicly available R package SIS, we provide a unified environment to carry out variable selection using iterative sure independence screening (ISIS) and all of its variants. For the regularization steps in the ISIS recruiting process, available penalties include the LASSO, SCAD, and MCP while the implemented variants for the screening steps are sample splitting, data-driven thresholding, and combinations thereof. Performance of these feature selection techniques is investigated by means of real and simulated data sets, where we find considerable improvements in terms of model selection and computational time between our algorithms and traditional penalized pseudo-likelihood methods applied directly to the full set of covariates. |
topic |
Cox model generalized linear models penalized likelihood estimation sparsity sure independence screening variable selection |
url |
https://www.jstatsoft.org/index.php/jss/article/view/3378 |
work_keys_str_mv |
AT diegofrancosaldana sisanrpackageforsureindependencescreeninginultrahighdimensionalstatisticalmodels AT yangfeng sisanrpackageforsureindependencescreeninginultrahighdimensionalstatisticalmodels |
_version_ |
1725743173389516800 |