Completeness of rheumatoid arthritis prevalence estimates from administrative health data: comparison of capture-recapture models

Rheumatoid arthritis (RA) is a chronic disease characterized by an overactive immune system and joint inflammation. Population-based administrative health data (AHD) are widely used for RA outcomes research and surveillance. However, AHD may not completely capture all cases of RA in the population....

Full description

Bibliographic Details
Main Author: Nie, Yao
Other Authors: Lix, Lisa (Community Health Sciences)
Published: 2014
Subjects:
Online Access:http://hdl.handle.net/1993/23679
id ndltd-MANITOBA-oai-mspace.lib.umanitoba.ca-1993-23679
record_format oai_dc
spelling ndltd-MANITOBA-oai-mspace.lib.umanitoba.ca-1993-236792015-07-09T03:48:34Z Completeness of rheumatoid arthritis prevalence estimates from administrative health data: comparison of capture-recapture models Nie, Yao Lix, Lisa (Community Health Sciences) Jiang, Depeng (Community Health Sciences) Muhajarine, Nazeem (University of Saskatchewan) Shiff, Natalie (University of Saskatchewan) Capture-Recapture Models Monte Carlo Simulation Prevalence Rheumatoid Arthritis Rheumatoid arthritis (RA) is a chronic disease characterized by an overactive immune system and joint inflammation. Population-based administrative health data (AHD) are widely used for RA outcomes research and surveillance. However, AHD may not completely capture all cases of RA in the population. Capture-recapture (CR) methods have been proposed to describe the completeness of AHD for estimating disease population size, but AHD may not conform to the assumptions that underlie CR models. A Monte Carlo simulation study was used to investigate the effects of violations of the assumptions for two-source CR methods: dependence between data sources and heterogeneity of capture probabilities. We compared the Chapman estimator and an estimator based on the multinomial logistic regression model (MLRM) to study relative bias (RB), coverage probability (CP) of 95% confidence intervals, width of 95% confidence intervals (WCI), and root-mean-square-error (RMSE) in prevalence estimates. The effects of misspecification of the MLRM were also investigated. In addition, the Chapman and MLRM estimators were used to estimate RA prevalence using AHD data from Saskatchewan, Canada. Population sizes were consistently underestimated for CR methods when the assumptions were violated. The estimated population size for both of the estimators did not differ substantially except for the RMSE values. Parameter estimates became biased when the MLRM model was misspecified, but there was little impact on population size estimates. In conclusion, CR methods are recommended to reduce bias in prevalence estimates based on AHDS. Because these methods may be sensitive to assumption violations, researchers should consider potential dependence between data sources. As well, sufficient overlap in the cases captured by each data source (e.g., 50% of the cases are captured by both data sources) or balanced capture probability in each data source is needed to effectively implement these methods. Researchers who estimate population size using CR methods in AHDs should favour the MLRM estimator over the Chapman estimator. 2014-07-03T14:37:31Z 2014-07-03T14:37:31Z 2014-07-03 http://hdl.handle.net/1993/23679
collection NDLTD
sources NDLTD
topic Capture-Recapture Models
Monte Carlo Simulation
Prevalence
Rheumatoid Arthritis
spellingShingle Capture-Recapture Models
Monte Carlo Simulation
Prevalence
Rheumatoid Arthritis
Nie, Yao
Completeness of rheumatoid arthritis prevalence estimates from administrative health data: comparison of capture-recapture models
description Rheumatoid arthritis (RA) is a chronic disease characterized by an overactive immune system and joint inflammation. Population-based administrative health data (AHD) are widely used for RA outcomes research and surveillance. However, AHD may not completely capture all cases of RA in the population. Capture-recapture (CR) methods have been proposed to describe the completeness of AHD for estimating disease population size, but AHD may not conform to the assumptions that underlie CR models. A Monte Carlo simulation study was used to investigate the effects of violations of the assumptions for two-source CR methods: dependence between data sources and heterogeneity of capture probabilities. We compared the Chapman estimator and an estimator based on the multinomial logistic regression model (MLRM) to study relative bias (RB), coverage probability (CP) of 95% confidence intervals, width of 95% confidence intervals (WCI), and root-mean-square-error (RMSE) in prevalence estimates. The effects of misspecification of the MLRM were also investigated. In addition, the Chapman and MLRM estimators were used to estimate RA prevalence using AHD data from Saskatchewan, Canada. Population sizes were consistently underestimated for CR methods when the assumptions were violated. The estimated population size for both of the estimators did not differ substantially except for the RMSE values. Parameter estimates became biased when the MLRM model was misspecified, but there was little impact on population size estimates. In conclusion, CR methods are recommended to reduce bias in prevalence estimates based on AHDS. Because these methods may be sensitive to assumption violations, researchers should consider potential dependence between data sources. As well, sufficient overlap in the cases captured by each data source (e.g., 50% of the cases are captured by both data sources) or balanced capture probability in each data source is needed to effectively implement these methods. Researchers who estimate population size using CR methods in AHDs should favour the MLRM estimator over the Chapman estimator.
author2 Lix, Lisa (Community Health Sciences)
author_facet Lix, Lisa (Community Health Sciences)
Nie, Yao
author Nie, Yao
author_sort Nie, Yao
title Completeness of rheumatoid arthritis prevalence estimates from administrative health data: comparison of capture-recapture models
title_short Completeness of rheumatoid arthritis prevalence estimates from administrative health data: comparison of capture-recapture models
title_full Completeness of rheumatoid arthritis prevalence estimates from administrative health data: comparison of capture-recapture models
title_fullStr Completeness of rheumatoid arthritis prevalence estimates from administrative health data: comparison of capture-recapture models
title_full_unstemmed Completeness of rheumatoid arthritis prevalence estimates from administrative health data: comparison of capture-recapture models
title_sort completeness of rheumatoid arthritis prevalence estimates from administrative health data: comparison of capture-recapture models
publishDate 2014
url http://hdl.handle.net/1993/23679
work_keys_str_mv AT nieyao completenessofrheumatoidarthritisprevalenceestimatesfromadministrativehealthdatacomparisonofcapturerecapturemodels
_version_ 1716807848191590400