Regression analyses with clustered time-to-event data when numbers of event are small
碩士 === 國立陽明大學 === 公共衛生研究所 === 101 === Background: In public health and clinical studies, parameter estimates in Cox regression models are usually unreliable when event is rare or when follow-up time is not longer enough. Previous studies have suggested that the event per variable (EPV) should be mor...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2013
|
Online Access: | http://ndltd.ncl.edu.tw/handle/70747709298584086585 |
id |
ndltd-TW-101YM005058039 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-101YM0050580392016-03-18T04:41:53Z http://ndltd.ncl.edu.tw/handle/70747709298584086585 Regression analyses with clustered time-to-event data when numbers of event are small 當事件數稀少時群聚存活資料之迴歸分析模式比較 Feng-Shiang Cheng 鄭鳳翔 碩士 國立陽明大學 公共衛生研究所 101 Background: In public health and clinical studies, parameter estimates in Cox regression models are usually unreliable when event is rare or when follow-up time is not longer enough. Previous studies have suggested that the event per variable (EPV) should be more than 10 when data are not independent. When data are correlated, e.g. cluster sampling, Cox models with robust sandwich variance estimates (Lee, Wei, &; Amato, 1989) and Frailty model are commonly used. How EPVs affect the parameter estimates in Cox model with clustered data remains to be explored. Objectives:The aim of this study was to evaluate the accuracy of the parameter estimates of cox models with clustered data when EPV is small. Methods:Simulation studies have been conducted with different numbers of event per variable, different number of clusters and cluster sizes, and varying between-cluster variations. This simulation study compared the estimates by different estimatos in Cox models: Cox model estimated by partial likelihood、Cox model estimated by Firth’s Penalized likelihood、 Cox model estimated by robust variance estimator、and Cox model estimated with random effect in cluster data. Results and Conclusions: Bias: When data were clustered with small EPV (EPV=5), the “Frailty” estimator method were less biased in general , no matter what the ratio of numbers of cluster to cluster size were, for both categorical and continuous independent variables. In the extreme case when number of cluster (m) is far smaller than cluster size (n), m=5 and n=100, or when the between-cluster variation was small, the “Firth” method is least biased. Efficiency:When EPV is small and data are clustered, for categorical variables, the “Robust” method usually underestimate the standard error when numbers of cluster larger than cluster size. On the other hand, the “Frailty” and the “Standard” method usually overestimate the standard error. While numbers of cluster were much smaller than cluster size,the both “Robust” and “ Frailty” method has underestimated the standard error, but ”Standard” method has overestimated it. Coverage rate of 95% confidence intervals:The 95% coverage rate by “Robust” method was in generally less than 95%, i.e., there were greater chances that the results of hypothesis testing for the parameter would be statistically significant than it should have been. Converge rate:The ”Frailty” method has worse converge rate while the converge rates by the ”Robust” 、the ”Standard”, and the ”Firth” methods were almos reach 100%. I-Feng Lin 林逸芬 2013 學位論文 ; thesis 35 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立陽明大學 === 公共衛生研究所 === 101 === Background: In public health and clinical studies, parameter estimates in Cox regression models are usually unreliable when event is rare or when follow-up time is not longer enough. Previous studies have suggested that the event per variable (EPV) should be more than 10 when data are not independent. When data are correlated, e.g. cluster sampling, Cox models with robust sandwich variance estimates (Lee, Wei, &; Amato, 1989) and Frailty model are commonly used. How EPVs affect the parameter estimates in Cox model with clustered data remains to be explored.
Objectives:The aim of this study was to evaluate the accuracy of the parameter estimates of cox models with clustered data when EPV is small.
Methods:Simulation studies have been conducted with different numbers of event per variable, different number of clusters and cluster sizes, and varying between-cluster variations. This simulation study compared the estimates by different estimatos in Cox models: Cox model estimated by partial likelihood、Cox model estimated by Firth’s Penalized likelihood、 Cox model estimated by robust variance estimator、and Cox model estimated with random effect in cluster data.
Results and Conclusions:
Bias: When data were clustered with small EPV (EPV=5), the “Frailty” estimator method were less biased in general , no matter what the ratio of numbers of cluster to cluster size were, for both categorical and continuous independent variables. In the extreme case when number of cluster (m) is far smaller than cluster size (n), m=5 and n=100, or when the between-cluster variation was small, the “Firth” method is least biased.
Efficiency:When EPV is small and data are clustered, for categorical variables, the “Robust” method usually underestimate the standard error when numbers of cluster larger than cluster size. On the other hand, the “Frailty” and the “Standard” method usually overestimate the standard error. While numbers of cluster were much smaller than cluster size,the both “Robust” and “ Frailty” method has underestimated the standard error, but ”Standard” method has overestimated it.
Coverage rate of 95% confidence intervals:The 95% coverage rate by “Robust” method was in generally less than 95%, i.e., there were greater chances that the results of hypothesis testing for the parameter would be statistically significant than it should have been.
Converge rate:The ”Frailty” method has worse converge rate while the converge rates by the ”Robust” 、the ”Standard”, and the ”Firth” methods were almos reach 100%.
|
author2 |
I-Feng Lin |
author_facet |
I-Feng Lin Feng-Shiang Cheng 鄭鳳翔 |
author |
Feng-Shiang Cheng 鄭鳳翔 |
spellingShingle |
Feng-Shiang Cheng 鄭鳳翔 Regression analyses with clustered time-to-event data when numbers of event are small |
author_sort |
Feng-Shiang Cheng |
title |
Regression analyses with clustered time-to-event data when numbers of event are small |
title_short |
Regression analyses with clustered time-to-event data when numbers of event are small |
title_full |
Regression analyses with clustered time-to-event data when numbers of event are small |
title_fullStr |
Regression analyses with clustered time-to-event data when numbers of event are small |
title_full_unstemmed |
Regression analyses with clustered time-to-event data when numbers of event are small |
title_sort |
regression analyses with clustered time-to-event data when numbers of event are small |
publishDate |
2013 |
url |
http://ndltd.ncl.edu.tw/handle/70747709298584086585 |
work_keys_str_mv |
AT fengshiangcheng regressionanalyseswithclusteredtimetoeventdatawhennumbersofeventaresmall AT zhèngfèngxiáng regressionanalyseswithclusteredtimetoeventdatawhennumbersofeventaresmall AT fengshiangcheng dāngshìjiànshùxīshǎoshíqúnjùcúnhuózīliàozhīhuíguīfēnxīmóshìbǐjiào AT zhèngfèngxiáng dāngshìjiànshùxīshǎoshíqúnjùcúnhuózīliàozhīhuíguīfēnxīmóshìbǐjiào |
_version_ |
1718207651590963200 |