Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports.

<h4>Study aim</h4>To develop and apply a natural language processing algorithm for characterization of patients diagnosed with chronic pancreatitis in a diverse integrated U.S. healthcare system.<h4>Methods</h4>Retrospective cohort study including patients initially diagnosed...

Full description

Bibliographic Details
Main Authors: Fagen Xie, Qiaoling Chen, Yichen Zhou, Wansu Chen, Jemianne Bautista, Emilie T Nguyen, Rex A Parker, Bechien U Wu
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0236817
id doaj-aeec2db61cb140dd8466438b1fe09827
record_format Article
spelling doaj-aeec2db61cb140dd8466438b1fe098272021-03-04T11:15:00ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01158e023681710.1371/journal.pone.0236817Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports.Fagen XieQiaoling ChenYichen ZhouWansu ChenJemianne BautistaEmilie T NguyenRex A ParkerBechien U Wu<h4>Study aim</h4>To develop and apply a natural language processing algorithm for characterization of patients diagnosed with chronic pancreatitis in a diverse integrated U.S. healthcare system.<h4>Methods</h4>Retrospective cohort study including patients initially diagnosed with chronic pancreatitis (CP) within a regional integrated healthcare system between January 1, 2006 and December 31, 2015. Imaging reports from these patients were extracted from the electronic medical record system and split into training, validation and implementation datasets. A natural language processing (NLP) algorithm was first developed through the training dataset to identify specific features (atrophy, calcification, pseudocyst, cyst and main duct dilatation) from free-text radiology reports. The validation dataset was applied to validate the performance by comparing against the manual chart review. The developed algorithm was then applied to the implementation dataset. We classified patients with calcification(s) or ≥2 radiographic features as advanced CP. We compared etiology, comorbid conditions, treatment parameters as well as survival between advanced CP and others diagnosed during the study period.<h4>Results</h4>6,346 patients were diagnosed with CP during the study period with 58,085 radiology studies performed. For individual features, NLP yielded sensitivity from 88.7% to 95.3%, specificity from 98.2% to 100.0%. A total of 3,672 patients met cohort inclusion criteria: 1,330 (36.2%) had evidence of advanced CP. Patients with advanced CP had increased frequency of smoking (57.8% vs. 43.0%), diabetes (47.6% vs. 35.9%) and underweight body mass index (6.6% vs. 3.6%), all p<0.001. Mortality from pancreatic cancer was higher in advanced CP (15.3/1,000 person-year vs. 2.8/1,000, p<0.001). Underweight BMI (HR 1.6, 95% CL 1.2, 2.1), smoking (HR 1.4, 95% CL 1.1, 1.7) and diabetes (HR 1.4, 95% CL 1.2, 1.6) were independent risk factors for mortality.<h4>Conclusion</h4>Patients with advanced CP experienced increased disease-related complications and pancreatic cancer-related mortality. Excess all-cause mortality was driven primarily by potentially modifiable risk factors including malnutrition, smoking and diabetes.https://doi.org/10.1371/journal.pone.0236817
collection DOAJ
language English
format Article
sources DOAJ
author Fagen Xie
Qiaoling Chen
Yichen Zhou
Wansu Chen
Jemianne Bautista
Emilie T Nguyen
Rex A Parker
Bechien U Wu
spellingShingle Fagen Xie
Qiaoling Chen
Yichen Zhou
Wansu Chen
Jemianne Bautista
Emilie T Nguyen
Rex A Parker
Bechien U Wu
Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports.
PLoS ONE
author_facet Fagen Xie
Qiaoling Chen
Yichen Zhou
Wansu Chen
Jemianne Bautista
Emilie T Nguyen
Rex A Parker
Bechien U Wu
author_sort Fagen Xie
title Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports.
title_short Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports.
title_full Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports.
title_fullStr Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports.
title_full_unstemmed Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports.
title_sort characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2020-01-01
description <h4>Study aim</h4>To develop and apply a natural language processing algorithm for characterization of patients diagnosed with chronic pancreatitis in a diverse integrated U.S. healthcare system.<h4>Methods</h4>Retrospective cohort study including patients initially diagnosed with chronic pancreatitis (CP) within a regional integrated healthcare system between January 1, 2006 and December 31, 2015. Imaging reports from these patients were extracted from the electronic medical record system and split into training, validation and implementation datasets. A natural language processing (NLP) algorithm was first developed through the training dataset to identify specific features (atrophy, calcification, pseudocyst, cyst and main duct dilatation) from free-text radiology reports. The validation dataset was applied to validate the performance by comparing against the manual chart review. The developed algorithm was then applied to the implementation dataset. We classified patients with calcification(s) or ≥2 radiographic features as advanced CP. We compared etiology, comorbid conditions, treatment parameters as well as survival between advanced CP and others diagnosed during the study period.<h4>Results</h4>6,346 patients were diagnosed with CP during the study period with 58,085 radiology studies performed. For individual features, NLP yielded sensitivity from 88.7% to 95.3%, specificity from 98.2% to 100.0%. A total of 3,672 patients met cohort inclusion criteria: 1,330 (36.2%) had evidence of advanced CP. Patients with advanced CP had increased frequency of smoking (57.8% vs. 43.0%), diabetes (47.6% vs. 35.9%) and underweight body mass index (6.6% vs. 3.6%), all p<0.001. Mortality from pancreatic cancer was higher in advanced CP (15.3/1,000 person-year vs. 2.8/1,000, p<0.001). Underweight BMI (HR 1.6, 95% CL 1.2, 2.1), smoking (HR 1.4, 95% CL 1.1, 1.7) and diabetes (HR 1.4, 95% CL 1.2, 1.6) were independent risk factors for mortality.<h4>Conclusion</h4>Patients with advanced CP experienced increased disease-related complications and pancreatic cancer-related mortality. Excess all-cause mortality was driven primarily by potentially modifiable risk factors including malnutrition, smoking and diabetes.
url https://doi.org/10.1371/journal.pone.0236817
work_keys_str_mv AT fagenxie characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports
AT qiaolingchen characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports
AT yichenzhou characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports
AT wansuchen characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports
AT jemiannebautista characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports
AT emilietnguyen characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports
AT rexaparker characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports
AT bechienuwu characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports
_version_ 1714804310516170752