SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models

We revisit sure independence screening procedures for variable selection in generalized linear models and the Cox proportional hazards model. Through the publicly available R package SIS, we provide a unified environment to carry out variable selection using iterative sure independence screening (IS...

Full description

Bibliographic Details
Main Authors: Diego Franco Saldana, Yang Feng
Format: Article
Language:English
Published: Foundation for Open Access Statistics 2018-02-01
Series:Journal of Statistical Software
Subjects:
Online Access:https://www.jstatsoft.org/index.php/jss/article/view/3378
id doaj-bcfd41b5e8b64cdd9a1b504541f0bf7d
record_format Article
spelling doaj-bcfd41b5e8b64cdd9a1b504541f0bf7d2020-11-24T22:29:48ZengFoundation for Open Access StatisticsJournal of Statistical Software1548-76602018-02-0183112510.18637/jss.v083.i021183SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical ModelsDiego Franco SaldanaYang FengWe revisit sure independence screening procedures for variable selection in generalized linear models and the Cox proportional hazards model. Through the publicly available R package SIS, we provide a unified environment to carry out variable selection using iterative sure independence screening (ISIS) and all of its variants. For the regularization steps in the ISIS recruiting process, available penalties include the LASSO, SCAD, and MCP while the implemented variants for the screening steps are sample splitting, data-driven thresholding, and combinations thereof. Performance of these feature selection techniques is investigated by means of real and simulated data sets, where we find considerable improvements in terms of model selection and computational time between our algorithms and traditional penalized pseudo-likelihood methods applied directly to the full set of covariates.https://www.jstatsoft.org/index.php/jss/article/view/3378Cox modelgeneralized linear modelspenalized likelihood estimationsparsitysure independence screeningvariable selection
collection DOAJ
language English
format Article
sources DOAJ
author Diego Franco Saldana
Yang Feng
spellingShingle Diego Franco Saldana
Yang Feng
SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models
Journal of Statistical Software
Cox model
generalized linear models
penalized likelihood estimation
sparsity
sure independence screening
variable selection
author_facet Diego Franco Saldana
Yang Feng
author_sort Diego Franco Saldana
title SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models
title_short SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models
title_full SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models
title_fullStr SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models
title_full_unstemmed SIS: An R Package for Sure Independence Screening in Ultrahigh-Dimensional Statistical Models
title_sort sis: an r package for sure independence screening in ultrahigh-dimensional statistical models
publisher Foundation for Open Access Statistics
series Journal of Statistical Software
issn 1548-7660
publishDate 2018-02-01
description We revisit sure independence screening procedures for variable selection in generalized linear models and the Cox proportional hazards model. Through the publicly available R package SIS, we provide a unified environment to carry out variable selection using iterative sure independence screening (ISIS) and all of its variants. For the regularization steps in the ISIS recruiting process, available penalties include the LASSO, SCAD, and MCP while the implemented variants for the screening steps are sample splitting, data-driven thresholding, and combinations thereof. Performance of these feature selection techniques is investigated by means of real and simulated data sets, where we find considerable improvements in terms of model selection and computational time between our algorithms and traditional penalized pseudo-likelihood methods applied directly to the full set of covariates.
topic Cox model
generalized linear models
penalized likelihood estimation
sparsity
sure independence screening
variable selection
url https://www.jstatsoft.org/index.php/jss/article/view/3378
work_keys_str_mv AT diegofrancosaldana sisanrpackageforsureindependencescreeninginultrahighdimensionalstatisticalmodels
AT yangfeng sisanrpackageforsureindependencescreeninginultrahighdimensionalstatisticalmodels
_version_ 1725743173389516800