Effects of incomplete inter-hospital network data on the assessment of transmission dynamics of hospital-acquired infections.

In the year 2020, there were 105 different statutory insurance companies in Germany with heterogeneous regional coverage. Obtaining data from all insurance companies is challenging, so that it is likely that projects will have to rely on data not covering the whole population. Consequently, the stud...

Full description

Bibliographic Details
Main Authors: Hanjue Xia, Johannes Horn, Monika J Piotrowska, Konrad Sakowski, André Karch, Hannan Tahir, Mirjam Kretzschmar, Rafael Mikolajczyk
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2021-05-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1008941
id doaj-b88a4bbec6824159aeab3ce21f2a7fb6
record_format Article
spelling doaj-b88a4bbec6824159aeab3ce21f2a7fb62021-05-30T04:32:00ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582021-05-01175e100894110.1371/journal.pcbi.1008941Effects of incomplete inter-hospital network data on the assessment of transmission dynamics of hospital-acquired infections.Hanjue XiaJohannes HornMonika J PiotrowskaKonrad SakowskiAndré KarchHannan TahirMirjam KretzschmarRafael MikolajczykIn the year 2020, there were 105 different statutory insurance companies in Germany with heterogeneous regional coverage. Obtaining data from all insurance companies is challenging, so that it is likely that projects will have to rely on data not covering the whole population. Consequently, the study of epidemic spread in hospital referral networks using data-driven models may be biased. We studied this bias using data from three German regional insurance companies covering four federal states: AOK (historically "general local health insurance company", but currently only the abbreviation is used) Lower Saxony (in Federal State of Lower Saxony), AOK Bavaria (in Bavaria), and AOK PLUS (in Thuringia and Saxony). To understand how incomplete data influence network characteristics and related epidemic simulations, we created sampled datasets by randomly dropping a proportion of patients from the full datasets and replacing them with random copies of the remaining patients to obtain scale-up datasets to the original size. For the sampled and scale-up datasets, we calculated several commonly used network measures, and compared them to those derived from the original data. We found that the network measures (degree, strength and closeness) were rather sensitive to incompleteness. Infection prevalence as an outcome from the applied susceptible-infectious-susceptible (SIS) model was fairly robust against incompleteness. At incompleteness levels as high as 90% of the original datasets the prevalence estimation bias was below 5% in scale-up datasets. Consequently, a coverage as low as 10% of the local population of the federal state population was sufficient to maintain the relative bias in prevalence below 10% for a wide range of transmission parameters as encountered in clinical settings. Our findings are reassuring that despite incomplete coverage of the population, German health insurance data can be used to study effects of patient traffic between institutions on the spread of pathogens within healthcare networks.https://doi.org/10.1371/journal.pcbi.1008941
collection DOAJ
language English
format Article
sources DOAJ
author Hanjue Xia
Johannes Horn
Monika J Piotrowska
Konrad Sakowski
André Karch
Hannan Tahir
Mirjam Kretzschmar
Rafael Mikolajczyk
spellingShingle Hanjue Xia
Johannes Horn
Monika J Piotrowska
Konrad Sakowski
André Karch
Hannan Tahir
Mirjam Kretzschmar
Rafael Mikolajczyk
Effects of incomplete inter-hospital network data on the assessment of transmission dynamics of hospital-acquired infections.
PLoS Computational Biology
author_facet Hanjue Xia
Johannes Horn
Monika J Piotrowska
Konrad Sakowski
André Karch
Hannan Tahir
Mirjam Kretzschmar
Rafael Mikolajczyk
author_sort Hanjue Xia
title Effects of incomplete inter-hospital network data on the assessment of transmission dynamics of hospital-acquired infections.
title_short Effects of incomplete inter-hospital network data on the assessment of transmission dynamics of hospital-acquired infections.
title_full Effects of incomplete inter-hospital network data on the assessment of transmission dynamics of hospital-acquired infections.
title_fullStr Effects of incomplete inter-hospital network data on the assessment of transmission dynamics of hospital-acquired infections.
title_full_unstemmed Effects of incomplete inter-hospital network data on the assessment of transmission dynamics of hospital-acquired infections.
title_sort effects of incomplete inter-hospital network data on the assessment of transmission dynamics of hospital-acquired infections.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2021-05-01
description In the year 2020, there were 105 different statutory insurance companies in Germany with heterogeneous regional coverage. Obtaining data from all insurance companies is challenging, so that it is likely that projects will have to rely on data not covering the whole population. Consequently, the study of epidemic spread in hospital referral networks using data-driven models may be biased. We studied this bias using data from three German regional insurance companies covering four federal states: AOK (historically "general local health insurance company", but currently only the abbreviation is used) Lower Saxony (in Federal State of Lower Saxony), AOK Bavaria (in Bavaria), and AOK PLUS (in Thuringia and Saxony). To understand how incomplete data influence network characteristics and related epidemic simulations, we created sampled datasets by randomly dropping a proportion of patients from the full datasets and replacing them with random copies of the remaining patients to obtain scale-up datasets to the original size. For the sampled and scale-up datasets, we calculated several commonly used network measures, and compared them to those derived from the original data. We found that the network measures (degree, strength and closeness) were rather sensitive to incompleteness. Infection prevalence as an outcome from the applied susceptible-infectious-susceptible (SIS) model was fairly robust against incompleteness. At incompleteness levels as high as 90% of the original datasets the prevalence estimation bias was below 5% in scale-up datasets. Consequently, a coverage as low as 10% of the local population of the federal state population was sufficient to maintain the relative bias in prevalence below 10% for a wide range of transmission parameters as encountered in clinical settings. Our findings are reassuring that despite incomplete coverage of the population, German health insurance data can be used to study effects of patient traffic between institutions on the spread of pathogens within healthcare networks.
url https://doi.org/10.1371/journal.pcbi.1008941
work_keys_str_mv AT hanjuexia effectsofincompleteinterhospitalnetworkdataontheassessmentoftransmissiondynamicsofhospitalacquiredinfections
AT johanneshorn effectsofincompleteinterhospitalnetworkdataontheassessmentoftransmissiondynamicsofhospitalacquiredinfections
AT monikajpiotrowska effectsofincompleteinterhospitalnetworkdataontheassessmentoftransmissiondynamicsofhospitalacquiredinfections
AT konradsakowski effectsofincompleteinterhospitalnetworkdataontheassessmentoftransmissiondynamicsofhospitalacquiredinfections
AT andrekarch effectsofincompleteinterhospitalnetworkdataontheassessmentoftransmissiondynamicsofhospitalacquiredinfections
AT hannantahir effectsofincompleteinterhospitalnetworkdataontheassessmentoftransmissiondynamicsofhospitalacquiredinfections
AT mirjamkretzschmar effectsofincompleteinterhospitalnetworkdataontheassessmentoftransmissiondynamicsofhospitalacquiredinfections
AT rafaelmikolajczyk effectsofincompleteinterhospitalnetworkdataontheassessmentoftransmissiondynamicsofhospitalacquiredinfections
_version_ 1721421119087443968