Development of Phenotyping Algorithms for the Identification of Organ Transplant Recipients: Cohort Study

BackgroundStudies involving organ transplant recipients (OTRs) are often limited to the variables collected in the national Scientific Registry of Transplant Recipients database. Electronic health records contain additional variables that can augment this data source if OTRs...

Full description

Bibliographic Details
Main Authors:	Wheless, Lee, Baker, Laura, Edwards, LaVar, Anand, Nimay, Birdwell, Kelly, Hanlon, Allison, Chren, Mary-Margaret
Format:	Article
Language:	English
Published:	JMIR Publications 2020-12-01
Series:	JMIR Medical Informatics
Online Access:	http://medinform.jmir.org/2020/12/e18001/

id	doaj-54e88b3b16ff4a48880249f4b7b93edd
record_format	Article
spelling	doaj-54e88b3b16ff4a48880249f4b7b93edd2021-05-03T02:53:23ZengJMIR PublicationsJMIR Medical Informatics2291-96942020-12-01812e1800110.2196/18001Development of Phenotyping Algorithms for the Identification of Organ Transplant Recipients: Cohort StudyWheless, LeeBaker, LauraEdwards, LaVarAnand, NimayBirdwell, KellyHanlon, AllisonChren, Mary-Margaret BackgroundStudies involving organ transplant recipients (OTRs) are often limited to the variables collected in the national Scientific Registry of Transplant Recipients database. Electronic health records contain additional variables that can augment this data source if OTRs can be identified accurately. ObjectiveThe aim of this study was to develop phenotyping algorithms to identify OTRs from electronic health records. MethodsWe used Vanderbilt’s deidentified version of its electronic health record database, which contains nearly 3 million subjects, to develop algorithms to identify OTRs. We identified all 19,817 individuals with at least one International Classification of Diseases (ICD) or Current Procedural Terminology (CPT) code for organ transplantation. We performed a chart review on 1350 randomly selected individuals to determine the transplant status. We constructed machine learning models to calculate positive predictive values and sensitivity for combinations of codes by using classification and regression trees, random forest, and extreme gradient boosting algorithms. ResultsOf the 1350 reviewed patient charts, 827 were organ transplant recipients while 511 had no record of a transplant, and 12 were equivocal. Most patients with only 1 or 2 transplant codes did not have a transplant. The most common reasons for being labeled a nontransplant patient were the lack of data (229/511, 44.8%) or the patient being evaluated for an organ transplant (174/511, 34.1%). All 3 machine learning algorithms identified OTRs with overall >90% positive predictive value and >88% sensitivity. ConclusionsElectronic health records linked to biobanks are increasingly used to conduct large-scale studies but have not been well-utilized in organ transplantation research. We present rigorously evaluated methods for phenotyping OTRs from electronic health records that will enable the use of the full spectrum of clinical data in transplant research. Using several different machine learning algorithms, we were able to identify transplant cases with high accuracy by using only ICD and CPT codes.http://medinform.jmir.org/2020/12/e18001/
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Wheless, Lee Baker, Laura Edwards, LaVar Anand, Nimay Birdwell, Kelly Hanlon, Allison Chren, Mary-Margaret
spellingShingle	Wheless, Lee Baker, Laura Edwards, LaVar Anand, Nimay Birdwell, Kelly Hanlon, Allison Chren, Mary-Margaret Development of Phenotyping Algorithms for the Identification of Organ Transplant Recipients: Cohort Study JMIR Medical Informatics
author_facet	Wheless, Lee Baker, Laura Edwards, LaVar Anand, Nimay Birdwell, Kelly Hanlon, Allison Chren, Mary-Margaret
author_sort	Wheless, Lee
title	Development of Phenotyping Algorithms for the Identification of Organ Transplant Recipients: Cohort Study
title_short	Development of Phenotyping Algorithms for the Identification of Organ Transplant Recipients: Cohort Study
title_full	Development of Phenotyping Algorithms for the Identification of Organ Transplant Recipients: Cohort Study
title_fullStr	Development of Phenotyping Algorithms for the Identification of Organ Transplant Recipients: Cohort Study
title_full_unstemmed	Development of Phenotyping Algorithms for the Identification of Organ Transplant Recipients: Cohort Study
title_sort	development of phenotyping algorithms for the identification of organ transplant recipients: cohort study
publisher	JMIR Publications
series	JMIR Medical Informatics
issn	2291-9694
publishDate	2020-12-01
description	BackgroundStudies involving organ transplant recipients (OTRs) are often limited to the variables collected in the national Scientific Registry of Transplant Recipients database. Electronic health records contain additional variables that can augment this data source if OTRs can be identified accurately. ObjectiveThe aim of this study was to develop phenotyping algorithms to identify OTRs from electronic health records. MethodsWe used Vanderbilt’s deidentified version of its electronic health record database, which contains nearly 3 million subjects, to develop algorithms to identify OTRs. We identified all 19,817 individuals with at least one International Classification of Diseases (ICD) or Current Procedural Terminology (CPT) code for organ transplantation. We performed a chart review on 1350 randomly selected individuals to determine the transplant status. We constructed machine learning models to calculate positive predictive values and sensitivity for combinations of codes by using classification and regression trees, random forest, and extreme gradient boosting algorithms. ResultsOf the 1350 reviewed patient charts, 827 were organ transplant recipients while 511 had no record of a transplant, and 12 were equivocal. Most patients with only 1 or 2 transplant codes did not have a transplant. The most common reasons for being labeled a nontransplant patient were the lack of data (229/511, 44.8%) or the patient being evaluated for an organ transplant (174/511, 34.1%). All 3 machine learning algorithms identified OTRs with overall >90% positive predictive value and >88% sensitivity. ConclusionsElectronic health records linked to biobanks are increasingly used to conduct large-scale studies but have not been well-utilized in organ transplantation research. We present rigorously evaluated methods for phenotyping OTRs from electronic health records that will enable the use of the full spectrum of clinical data in transplant research. Using several different machine learning algorithms, we were able to identify transplant cases with high accuracy by using only ICD and CPT codes.
url	http://medinform.jmir.org/2020/12/e18001/
work_keys_str_mv	AT whelesslee developmentofphenotypingalgorithmsfortheidentificationoforgantransplantrecipientscohortstudy AT bakerlaura developmentofphenotypingalgorithmsfortheidentificationoforgantransplantrecipientscohortstudy AT edwardslavar developmentofphenotypingalgorithmsfortheidentificationoforgantransplantrecipientscohortstudy AT anandnimay developmentofphenotypingalgorithmsfortheidentificationoforgantransplantrecipientscohortstudy AT birdwellkelly developmentofphenotypingalgorithmsfortheidentificationoforgantransplantrecipientscohortstudy AT hanlonallison developmentofphenotypingalgorithmsfortheidentificationoforgantransplantrecipientscohortstudy AT chrenmarymargaret developmentofphenotypingalgorithmsfortheidentificationoforgantransplantrecipientscohortstudy
_version_	1721484967149568000

Development of Phenotyping Algorithms for the Identification of Organ Transplant Recipients: Cohort Study

Similar Items