A preliminary study of variable selection in penalized logistic regression with rare events data
碩士 === 國立成功大學 === 統計學系 === 107 === It's well known that the accuracy of MLE of the regression coefficient in logistic regression model is seriously affected by rare events. Less attention is given to the performance of variable selection in logistic regression with rare events. Therefore, this...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2019
|
Online Access: | http://ndltd.ncl.edu.tw/handle/2g62ac |
id |
ndltd-TW-107NCKU5337022 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-107NCKU53370222019-10-26T06:24:15Z http://ndltd.ncl.edu.tw/handle/2g62ac A preliminary study of variable selection in penalized logistic regression with rare events data 在稀少事件下邏輯式迴歸於三種懲罰項的變數篩選能力之初步探討 Ding-HuangLin 林鼎晃 碩士 國立成功大學 統計學系 107 It's well known that the accuracy of MLE of the regression coefficient in logistic regression model is seriously affected by rare events. Less attention is given to the performance of variable selection in logistic regression with rare events. Therefore, this thesis studies the performance of three variable selection methods, LASSO (Least Absolute Shrinkage and Selection Operator), SCAD (Smoothly Clipper Absolute Deviation), and Adaptive LASSO, when event rate is low and the number of explanatory variables is much larger than sample sizes. A simulation study is conducted to compare the accuracy in selecting important explanatory variables of logistic regression model. Based on limited simulation scenarios, when event rate is as low as 0.05, the simulation results recommended using Adaptive LASSO to select important explanatory variables. Consequently, Adaptive LASSO is recommended for variable selection and prediction with rare events data. Yun-Chan Chi 嵇允嬋 2019 學位論文 ; thesis 33 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 統計學系 === 107 === It's well known that the accuracy of MLE of the regression coefficient in logistic regression model is seriously affected by rare events. Less attention is given to the performance of variable selection in logistic regression with rare events. Therefore, this thesis studies the performance of three variable selection methods, LASSO (Least Absolute Shrinkage and Selection Operator), SCAD (Smoothly Clipper Absolute Deviation), and Adaptive LASSO, when event rate is low and the number of explanatory variables is much larger than sample sizes.
A simulation study is conducted to compare the accuracy in selecting important explanatory variables of logistic regression model. Based on limited simulation scenarios, when event rate is as low as 0.05, the simulation results recommended using Adaptive LASSO to select important explanatory variables. Consequently, Adaptive LASSO is recommended for variable selection and prediction with rare events data.
|
author2 |
Yun-Chan Chi |
author_facet |
Yun-Chan Chi Ding-HuangLin 林鼎晃 |
author |
Ding-HuangLin 林鼎晃 |
spellingShingle |
Ding-HuangLin 林鼎晃 A preliminary study of variable selection in penalized logistic regression with rare events data |
author_sort |
Ding-HuangLin |
title |
A preliminary study of variable selection in penalized logistic regression with rare events data |
title_short |
A preliminary study of variable selection in penalized logistic regression with rare events data |
title_full |
A preliminary study of variable selection in penalized logistic regression with rare events data |
title_fullStr |
A preliminary study of variable selection in penalized logistic regression with rare events data |
title_full_unstemmed |
A preliminary study of variable selection in penalized logistic regression with rare events data |
title_sort |
preliminary study of variable selection in penalized logistic regression with rare events data |
publishDate |
2019 |
url |
http://ndltd.ncl.edu.tw/handle/2g62ac |
work_keys_str_mv |
AT dinghuanglin apreliminarystudyofvariableselectioninpenalizedlogisticregressionwithrareeventsdata AT líndǐnghuǎng apreliminarystudyofvariableselectioninpenalizedlogisticregressionwithrareeventsdata AT dinghuanglin zàixīshǎoshìjiànxiàluójíshìhuíguīyúsānzhǒngchéngfáxiàngdebiànshùshāixuǎnnénglìzhīchūbùtàntǎo AT líndǐnghuǎng zàixīshǎoshìjiànxiàluójíshìhuíguīyúsānzhǒngchéngfáxiàngdebiànshùshāixuǎnnénglìzhīchūbùtàntǎo AT dinghuanglin preliminarystudyofvariableselectioninpenalizedlogisticregressionwithrareeventsdata AT líndǐnghuǎng preliminarystudyofvariableselectioninpenalizedlogisticregressionwithrareeventsdata |
_version_ |
1719279271843725312 |