Regression analyses with clustered time-to-event data when numbers of event are small

碩士 === 國立陽明大學 === 公共衛生研究所 === 101 === Background: In public health and clinical studies, parameter estimates in Cox regression models are usually unreliable when event is rare or when follow-up time is not longer enough. Previous studies have suggested that the event per variable (EPV) should be mor...

Full description

Bibliographic Details
Main Authors: Feng-Shiang Cheng, 鄭鳳翔
Other Authors: I-Feng Lin
Format: Others
Language:zh-TW
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/70747709298584086585
id ndltd-TW-101YM005058039
record_format oai_dc
spelling ndltd-TW-101YM0050580392016-03-18T04:41:53Z http://ndltd.ncl.edu.tw/handle/70747709298584086585 Regression analyses with clustered time-to-event data when numbers of event are small 當事件數稀少時群聚存活資料之迴歸分析模式比較 Feng-Shiang Cheng 鄭鳳翔 碩士 國立陽明大學 公共衛生研究所 101 Background: In public health and clinical studies, parameter estimates in Cox regression models are usually unreliable when event is rare or when follow-up time is not longer enough. Previous studies have suggested that the event per variable (EPV) should be more than 10 when data are not independent. When data are correlated, e.g. cluster sampling, Cox models with robust sandwich variance estimates (Lee, Wei, &; Amato, 1989) and Frailty model are commonly used. How EPVs affect the parameter estimates in Cox model with clustered data remains to be explored. Objectives:The aim of this study was to evaluate the accuracy of the parameter estimates of cox models with clustered data when EPV is small. Methods:Simulation studies have been conducted with different numbers of event per variable, different number of clusters and cluster sizes, and varying between-cluster variations. This simulation study compared the estimates by different estimatos in Cox models: Cox model estimated by partial likelihood、Cox model estimated by Firth’s Penalized likelihood、 Cox model estimated by robust variance estimator、and Cox model estimated with random effect in cluster data. Results and Conclusions: Bias: When data were clustered with small EPV (EPV=5), the “Frailty” estimator method were less biased in general , no matter what the ratio of numbers of cluster to cluster size were, for both categorical and continuous independent variables. In the extreme case when number of cluster (m) is far smaller than cluster size (n), m=5 and n=100, or when the between-cluster variation was small, the “Firth” method is least biased. Efficiency:When EPV is small and data are clustered, for categorical variables, the “Robust” method usually underestimate the standard error when numbers of cluster larger than cluster size. On the other hand, the “Frailty” and the “Standard” method usually overestimate the standard error. While numbers of cluster were much smaller than cluster size,the both “Robust” and “ Frailty” method has underestimated the standard error, but ”Standard” method has overestimated it. Coverage rate of 95% confidence intervals:The 95% coverage rate by “Robust” method was in generally less than 95%, i.e., there were greater chances that the results of hypothesis testing for the parameter would be statistically significant than it should have been. Converge rate:The ”Frailty” method has worse converge rate while the converge rates by the ”Robust” 、the ”Standard”, and the ”Firth” methods were almos reach 100%. I-Feng Lin 林逸芬 2013 學位論文 ; thesis 35 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立陽明大學 === 公共衛生研究所 === 101 === Background: In public health and clinical studies, parameter estimates in Cox regression models are usually unreliable when event is rare or when follow-up time is not longer enough. Previous studies have suggested that the event per variable (EPV) should be more than 10 when data are not independent. When data are correlated, e.g. cluster sampling, Cox models with robust sandwich variance estimates (Lee, Wei, &; Amato, 1989) and Frailty model are commonly used. How EPVs affect the parameter estimates in Cox model with clustered data remains to be explored. Objectives:The aim of this study was to evaluate the accuracy of the parameter estimates of cox models with clustered data when EPV is small. Methods:Simulation studies have been conducted with different numbers of event per variable, different number of clusters and cluster sizes, and varying between-cluster variations. This simulation study compared the estimates by different estimatos in Cox models: Cox model estimated by partial likelihood、Cox model estimated by Firth’s Penalized likelihood、 Cox model estimated by robust variance estimator、and Cox model estimated with random effect in cluster data. Results and Conclusions: Bias: When data were clustered with small EPV (EPV=5), the “Frailty” estimator method were less biased in general , no matter what the ratio of numbers of cluster to cluster size were, for both categorical and continuous independent variables. In the extreme case when number of cluster (m) is far smaller than cluster size (n), m=5 and n=100, or when the between-cluster variation was small, the “Firth” method is least biased. Efficiency:When EPV is small and data are clustered, for categorical variables, the “Robust” method usually underestimate the standard error when numbers of cluster larger than cluster size. On the other hand, the “Frailty” and the “Standard” method usually overestimate the standard error. While numbers of cluster were much smaller than cluster size,the both “Robust” and “ Frailty” method has underestimated the standard error, but ”Standard” method has overestimated it. Coverage rate of 95% confidence intervals:The 95% coverage rate by “Robust” method was in generally less than 95%, i.e., there were greater chances that the results of hypothesis testing for the parameter would be statistically significant than it should have been. Converge rate:The ”Frailty” method has worse converge rate while the converge rates by the ”Robust” 、the ”Standard”, and the ”Firth” methods were almos reach 100%.
author2 I-Feng Lin
author_facet I-Feng Lin
Feng-Shiang Cheng
鄭鳳翔
author Feng-Shiang Cheng
鄭鳳翔
spellingShingle Feng-Shiang Cheng
鄭鳳翔
Regression analyses with clustered time-to-event data when numbers of event are small
author_sort Feng-Shiang Cheng
title Regression analyses with clustered time-to-event data when numbers of event are small
title_short Regression analyses with clustered time-to-event data when numbers of event are small
title_full Regression analyses with clustered time-to-event data when numbers of event are small
title_fullStr Regression analyses with clustered time-to-event data when numbers of event are small
title_full_unstemmed Regression analyses with clustered time-to-event data when numbers of event are small
title_sort regression analyses with clustered time-to-event data when numbers of event are small
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/70747709298584086585
work_keys_str_mv AT fengshiangcheng regressionanalyseswithclusteredtimetoeventdatawhennumbersofeventaresmall
AT zhèngfèngxiáng regressionanalyseswithclusteredtimetoeventdatawhennumbersofeventaresmall
AT fengshiangcheng dāngshìjiànshùxīshǎoshíqúnjùcúnhuózīliàozhīhuíguīfēnxīmóshìbǐjiào
AT zhèngfèngxiáng dāngshìjiànshùxīshǎoshíqúnjùcúnhuózīliàozhīhuíguīfēnxīmóshìbǐjiào
_version_ 1718207651590963200