Meta Analysis and Meta Regression of a Measure of Discrimination Used in Prognostic Modeling
In this paper we are interested in predicting death with the underlying cause of coronary heart disease (CHD). There are two prognostic modeling methods used to predict CHD: the logistic model and the proportional hazard model. For this paper we consider the logistic model. The dataset used is the D...
Other Authors: | |
---|---|
Format: | Others |
Language: | English English |
Published: |
Florida State University
|
Subjects: | |
Online Access: | http://purl.flvc.org/fsu/fd/FSU_migr_etd-7580 |
id |
ndltd-fsu.edu-oai-fsu.digital.flvc.org-fsu_183880 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-fsu.edu-oai-fsu.digital.flvc.org-fsu_1838802020-06-16T03:08:52Z Meta Analysis and Meta Regression of a Measure of Discrimination Used in Prognostic Modeling Rivera, Gretchen L. (authoraut) McGee, Daniel (professor directing thesis) Hurt, Myra (university representative) Niu, Xufeng (committee member) Sinha, Debajyoti (committee member) Department of Statistics (degree granting department) Florida State University (degree granting institution) Text text Florida State University Florida State University English eng 1 online resource computer application/pdf In this paper we are interested in predicting death with the underlying cause of coronary heart disease (CHD). There are two prognostic modeling methods used to predict CHD: the logistic model and the proportional hazard model. For this paper we consider the logistic model. The dataset used is the Diverse Populations Collaboration (DPC) dataset which includes 28 studies. The DPC dataset has epidemiological results from investigation conducted in different populations around the world. For our analysis we include those individuals who are 17 years old or older. The predictors are: age, diabetes, total serum cholesterol (mg/dl), high density lipoprotein (mg/dl), systolic blood pressure (mmHg) and if the participant is a current cigarette smoker. There is a natural grouping within the studies such as gender, rural or urban area and race. Based on these strata we have 84 cohort groups. Our main interest is to evaluate how well the prognostic model discriminates. For this, we used the area under the Receiver Operating Characteristic (ROC) curve. The main idea of the ROC curve is that a set of subject is known to belong to one of two classes (signal or noise group). Then an assignment procedure assigns each object to a class on the basis of information observed. The assignment procedure is not perfect: sometimes an object is misclassified. We want to evaluate the quality of performance of this procedure, for this we used the Area under the ROC curve (AUROC). The AUROC varies from 0.5 (no apparent accuracy) to 1.0 (perfect accuracy). For each logistic model we found the AUROC and its standard error (SE). We used Meta-analysis to summarize the estimated AUROCs and to evaluate if there is heterogeneity in our estimates. To evaluate the existence of significant heterogeneity we used the Q statistic. Since heterogeneity was found in our study we compare seven different methods for estimating τ2 (between study variance). We conclude by examining whether differences in study characteristics explained the heterogeneity in the values of the AUROC. A Dissertation submitted to the Department of Statistics in partial fulfillment of the requirements for the degree of Doctor of Philosophy. Spring Semester, 2013. March 19, 2013. Includes bibliographical references. Daniel McGee, Professor Directing Thesis; Myra Hurt, University Representative; Xufeng Niu, Committee Member; Debajyoti Sinha, Committee Member. Statistics FSU_migr_etd-7580 http://purl.flvc.org/fsu/fd/FSU_migr_etd-7580 This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). The copyright in theses and dissertations completed at Florida State University is held by the students who author them. http://diginole.lib.fsu.edu/islandora/object/fsu%3A183880/datastream/TN/view/Meta%20Analysis%20and%20Meta%20Regression%20of%20a%20Measure%20of%20Discrimination%20Used%20in%20Prognostic%20Modeling.jpg |
collection |
NDLTD |
language |
English English |
format |
Others
|
sources |
NDLTD |
topic |
Statistics |
spellingShingle |
Statistics Meta Analysis and Meta Regression of a Measure of Discrimination Used in Prognostic Modeling |
description |
In this paper we are interested in predicting death with the underlying cause of coronary heart disease (CHD). There are two prognostic modeling methods used to predict CHD: the logistic model and the proportional hazard model. For this paper we consider the logistic model. The dataset used is the Diverse Populations Collaboration (DPC) dataset which includes 28 studies. The DPC dataset has epidemiological results from investigation conducted in different populations around the world. For our analysis we include those individuals who are 17 years old or older. The predictors are: age, diabetes, total serum cholesterol (mg/dl), high density lipoprotein (mg/dl), systolic blood pressure (mmHg) and if the participant is a current cigarette smoker. There is a natural grouping within the studies such as gender, rural or urban area and race. Based on these strata we have 84 cohort groups. Our main interest is to evaluate how well the prognostic model discriminates. For this, we used the area under the Receiver Operating Characteristic (ROC) curve. The main idea of the ROC curve is that a set of subject is known to belong to one of two classes (signal or noise group). Then an assignment procedure assigns each object to a class on the basis of information observed. The assignment procedure is not perfect: sometimes an object is misclassified. We want to evaluate the quality of performance of this procedure, for this we used the Area under the ROC curve (AUROC). The AUROC varies from 0.5 (no apparent accuracy) to 1.0 (perfect accuracy). For each logistic model we found the AUROC and its standard error (SE). We used Meta-analysis to summarize the estimated AUROCs and to evaluate if there is heterogeneity in our estimates. To evaluate the existence of significant heterogeneity we used the Q statistic. Since heterogeneity was found in our study we compare seven different methods for estimating τ2 (between study variance). We conclude by examining whether differences in study characteristics explained the heterogeneity in the values of the AUROC. === A Dissertation submitted to the Department of Statistics in partial fulfillment of the requirements for the degree of Doctor of Philosophy. === Spring Semester, 2013. === March 19, 2013. === Includes bibliographical references. === Daniel McGee, Professor Directing Thesis; Myra Hurt, University Representative; Xufeng Niu, Committee Member; Debajyoti Sinha, Committee Member. |
author2 |
Rivera, Gretchen L. (authoraut) |
author_facet |
Rivera, Gretchen L. (authoraut) |
title |
Meta Analysis and Meta Regression of a Measure of Discrimination Used in Prognostic Modeling |
title_short |
Meta Analysis and Meta Regression of a Measure of Discrimination Used in Prognostic Modeling |
title_full |
Meta Analysis and Meta Regression of a Measure of Discrimination Used in Prognostic Modeling |
title_fullStr |
Meta Analysis and Meta Regression of a Measure of Discrimination Used in Prognostic Modeling |
title_full_unstemmed |
Meta Analysis and Meta Regression of a Measure of Discrimination Used in Prognostic Modeling |
title_sort |
meta analysis and meta regression of a measure of discrimination used in prognostic modeling |
publisher |
Florida State University |
url |
http://purl.flvc.org/fsu/fd/FSU_migr_etd-7580 |
_version_ |
1719320105621389312 |