A preliminary study of variable selection in penalized logistic regression with rare events data

碩士 === 國立成功大學 === 統計學系 === 107 === It's well known that the accuracy of MLE of the regression coefficient in logistic regression model is seriously affected by rare events. Less attention is given to the performance of variable selection in logistic regression with rare events. Therefore, this...

Full description

Bibliographic Details
Main Authors: Ding-HuangLin, 林鼎晃
Other Authors: Yun-Chan Chi
Format: Others
Language:zh-TW
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/2g62ac
id ndltd-TW-107NCKU5337022
record_format oai_dc
spelling ndltd-TW-107NCKU53370222019-10-26T06:24:15Z http://ndltd.ncl.edu.tw/handle/2g62ac A preliminary study of variable selection in penalized logistic regression with rare events data 在稀少事件下邏輯式迴歸於三種懲罰項的變數篩選能力之初步探討 Ding-HuangLin 林鼎晃 碩士 國立成功大學 統計學系 107 It's well known that the accuracy of MLE of the regression coefficient in logistic regression model is seriously affected by rare events. Less attention is given to the performance of variable selection in logistic regression with rare events. Therefore, this thesis studies the performance of three variable selection methods, LASSO (Least Absolute Shrinkage and Selection Operator), SCAD (Smoothly Clipper Absolute Deviation), and Adaptive LASSO, when event rate is low and the number of explanatory variables is much larger than sample sizes. A simulation study is conducted to compare the accuracy in selecting important explanatory variables of logistic regression model. Based on limited simulation scenarios, when event rate is as low as 0.05, the simulation results recommended using Adaptive LASSO to select important explanatory variables. Consequently, Adaptive LASSO is recommended for variable selection and prediction with rare events data. Yun-Chan Chi 嵇允嬋 2019 學位論文 ; thesis 33 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 統計學系 === 107 === It's well known that the accuracy of MLE of the regression coefficient in logistic regression model is seriously affected by rare events. Less attention is given to the performance of variable selection in logistic regression with rare events. Therefore, this thesis studies the performance of three variable selection methods, LASSO (Least Absolute Shrinkage and Selection Operator), SCAD (Smoothly Clipper Absolute Deviation), and Adaptive LASSO, when event rate is low and the number of explanatory variables is much larger than sample sizes. A simulation study is conducted to compare the accuracy in selecting important explanatory variables of logistic regression model. Based on limited simulation scenarios, when event rate is as low as 0.05, the simulation results recommended using Adaptive LASSO to select important explanatory variables. Consequently, Adaptive LASSO is recommended for variable selection and prediction with rare events data.
author2 Yun-Chan Chi
author_facet Yun-Chan Chi
Ding-HuangLin
林鼎晃
author Ding-HuangLin
林鼎晃
spellingShingle Ding-HuangLin
林鼎晃
A preliminary study of variable selection in penalized logistic regression with rare events data
author_sort Ding-HuangLin
title A preliminary study of variable selection in penalized logistic regression with rare events data
title_short A preliminary study of variable selection in penalized logistic regression with rare events data
title_full A preliminary study of variable selection in penalized logistic regression with rare events data
title_fullStr A preliminary study of variable selection in penalized logistic regression with rare events data
title_full_unstemmed A preliminary study of variable selection in penalized logistic regression with rare events data
title_sort preliminary study of variable selection in penalized logistic regression with rare events data
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/2g62ac
work_keys_str_mv AT dinghuanglin apreliminarystudyofvariableselectioninpenalizedlogisticregressionwithrareeventsdata
AT líndǐnghuǎng apreliminarystudyofvariableselectioninpenalizedlogisticregressionwithrareeventsdata
AT dinghuanglin zàixīshǎoshìjiànxiàluójíshìhuíguīyúsānzhǒngchéngfáxiàngdebiànshùshāixuǎnnénglìzhīchūbùtàntǎo
AT líndǐnghuǎng zàixīshǎoshìjiànxiàluójíshìhuíguīyúsānzhǒngchéngfáxiàngdebiànshùshāixuǎnnénglìzhīchūbùtàntǎo
AT dinghuanglin preliminarystudyofvariableselectioninpenalizedlogisticregressionwithrareeventsdata
AT líndǐnghuǎng preliminarystudyofvariableselectioninpenalizedlogisticregressionwithrareeventsdata
_version_ 1719279271843725312