Learning to prevent healthcare-associated infections : leveraging data across time and space to improve local predictions

Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2014. === 148 === Cataloged from PDF version of thesis. === Includes bibliographical references (pages 136-149). === The proliferation of electronic medical records holds out the promise...

Full description

Bibliographic Details
Main Author: Wiens, Jenna Marleau
Other Authors: John Guttag.
Format: Others
Language:English
Published: Massachusetts Institute of Technology 2014
Subjects:
Online Access:http://hdl.handle.net/1721.1/91103
id ndltd-MIT-oai-dspace.mit.edu-1721.1-91103
record_format oai_dc
spelling ndltd-MIT-oai-dspace.mit.edu-1721.1-911032019-05-02T16:23:51Z Learning to prevent healthcare-associated infections : leveraging data across time and space to improve local predictions Wiens, Jenna Marleau John Guttag. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2014. 148 Cataloged from PDF version of thesis. Includes bibliographical references (pages 136-149). The proliferation of electronic medical records holds out the promise of using machine learning and data mining to build models that will help healthcare providers improve patient outcomes. However, building useful models from these datasets presents many technical problems. Among the challenges are the large number of factors (both intrinsic and extrinsic) influencing a patient's risk of an adverse outcome, the inherent evolution of that risk over time, and the relative rarity of adverse outcomes, institutional differences and the lack of ground truth. In this thesis we tackle these challenges in the context of predicting healthcare-associated infections (HAIs). HAIs are a serious problem in US acute care hospitals, affecting approximately 4% of all inpatients on any given day. Despite best efforts to reduce incidence, HAIs remain stubbornly prevalent. We hypothesize that one of the reasons why is lack of an effective clinical tool for accurately measuring patient risk. Therefore, we develop accurate models for predicting which patients are at risk of acquiring an infection with Clostridium difficile (a common HAI). In contrast to previous work, we take a novel data-centric approach, leveraging the contents of EMRs from over 100,000 hospital admissions. We show how, by adapting techniques from time-series classification, transfer learning and multitask learning, we can learn more accurate models for patient risk stratification. Our model, based on thousands of variables both time-varying and time-invariant, does not remain static but changes over the course of a patient admission. Applied to a held-out validation set of 25,000 patient admissions. our model achieved an area under the receiver operating characteristic curve of 0.81 (95% CI 0.78-0.84). The model has been successfully integrated into the health record system at a large hospital in the US, and is being used to produce daily risk estimates for each inpatient. While more complex than traditional risk stratification methods, the widespread development and use of such data-driven models could ultimately enable cost-effective, targeted prevention strategies that reduce the incidence of HAIs. by Jenna Wiens. Ph. D. 2014-10-21T17:26:14Z 2014-10-21T17:26:14Z 2014 2014 Thesis http://hdl.handle.net/1721.1/91103 892749892 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 149 pages application/pdf Massachusetts Institute of Technology
collection NDLTD
language English
format Others
sources NDLTD
topic Electrical Engineering and Computer Science.
spellingShingle Electrical Engineering and Computer Science.
Wiens, Jenna Marleau
Learning to prevent healthcare-associated infections : leveraging data across time and space to improve local predictions
description Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2014. === 148 === Cataloged from PDF version of thesis. === Includes bibliographical references (pages 136-149). === The proliferation of electronic medical records holds out the promise of using machine learning and data mining to build models that will help healthcare providers improve patient outcomes. However, building useful models from these datasets presents many technical problems. Among the challenges are the large number of factors (both intrinsic and extrinsic) influencing a patient's risk of an adverse outcome, the inherent evolution of that risk over time, and the relative rarity of adverse outcomes, institutional differences and the lack of ground truth. In this thesis we tackle these challenges in the context of predicting healthcare-associated infections (HAIs). HAIs are a serious problem in US acute care hospitals, affecting approximately 4% of all inpatients on any given day. Despite best efforts to reduce incidence, HAIs remain stubbornly prevalent. We hypothesize that one of the reasons why is lack of an effective clinical tool for accurately measuring patient risk. Therefore, we develop accurate models for predicting which patients are at risk of acquiring an infection with Clostridium difficile (a common HAI). In contrast to previous work, we take a novel data-centric approach, leveraging the contents of EMRs from over 100,000 hospital admissions. We show how, by adapting techniques from time-series classification, transfer learning and multitask learning, we can learn more accurate models for patient risk stratification. Our model, based on thousands of variables both time-varying and time-invariant, does not remain static but changes over the course of a patient admission. Applied to a held-out validation set of 25,000 patient admissions. our model achieved an area under the receiver operating characteristic curve of 0.81 (95% CI 0.78-0.84). The model has been successfully integrated into the health record system at a large hospital in the US, and is being used to produce daily risk estimates for each inpatient. While more complex than traditional risk stratification methods, the widespread development and use of such data-driven models could ultimately enable cost-effective, targeted prevention strategies that reduce the incidence of HAIs. === by Jenna Wiens. === Ph. D.
author2 John Guttag.
author_facet John Guttag.
Wiens, Jenna Marleau
author Wiens, Jenna Marleau
author_sort Wiens, Jenna Marleau
title Learning to prevent healthcare-associated infections : leveraging data across time and space to improve local predictions
title_short Learning to prevent healthcare-associated infections : leveraging data across time and space to improve local predictions
title_full Learning to prevent healthcare-associated infections : leveraging data across time and space to improve local predictions
title_fullStr Learning to prevent healthcare-associated infections : leveraging data across time and space to improve local predictions
title_full_unstemmed Learning to prevent healthcare-associated infections : leveraging data across time and space to improve local predictions
title_sort learning to prevent healthcare-associated infections : leveraging data across time and space to improve local predictions
publisher Massachusetts Institute of Technology
publishDate 2014
url http://hdl.handle.net/1721.1/91103
work_keys_str_mv AT wiensjennamarleau learningtopreventhealthcareassociatedinfectionsleveragingdataacrosstimeandspacetoimprovelocalpredictions
_version_ 1719040124794175488