Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports.
<h4>Study aim</h4>To develop and apply a natural language processing algorithm for characterization of patients diagnosed with chronic pancreatitis in a diverse integrated U.S. healthcare system.<h4>Methods</h4>Retrospective cohort study including patients initially diagnosed...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2020-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0236817 |
id |
doaj-aeec2db61cb140dd8466438b1fe09827 |
---|---|
record_format |
Article |
spelling |
doaj-aeec2db61cb140dd8466438b1fe098272021-03-04T11:15:00ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01158e023681710.1371/journal.pone.0236817Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports.Fagen XieQiaoling ChenYichen ZhouWansu ChenJemianne BautistaEmilie T NguyenRex A ParkerBechien U Wu<h4>Study aim</h4>To develop and apply a natural language processing algorithm for characterization of patients diagnosed with chronic pancreatitis in a diverse integrated U.S. healthcare system.<h4>Methods</h4>Retrospective cohort study including patients initially diagnosed with chronic pancreatitis (CP) within a regional integrated healthcare system between January 1, 2006 and December 31, 2015. Imaging reports from these patients were extracted from the electronic medical record system and split into training, validation and implementation datasets. A natural language processing (NLP) algorithm was first developed through the training dataset to identify specific features (atrophy, calcification, pseudocyst, cyst and main duct dilatation) from free-text radiology reports. The validation dataset was applied to validate the performance by comparing against the manual chart review. The developed algorithm was then applied to the implementation dataset. We classified patients with calcification(s) or ≥2 radiographic features as advanced CP. We compared etiology, comorbid conditions, treatment parameters as well as survival between advanced CP and others diagnosed during the study period.<h4>Results</h4>6,346 patients were diagnosed with CP during the study period with 58,085 radiology studies performed. For individual features, NLP yielded sensitivity from 88.7% to 95.3%, specificity from 98.2% to 100.0%. A total of 3,672 patients met cohort inclusion criteria: 1,330 (36.2%) had evidence of advanced CP. Patients with advanced CP had increased frequency of smoking (57.8% vs. 43.0%), diabetes (47.6% vs. 35.9%) and underweight body mass index (6.6% vs. 3.6%), all p<0.001. Mortality from pancreatic cancer was higher in advanced CP (15.3/1,000 person-year vs. 2.8/1,000, p<0.001). Underweight BMI (HR 1.6, 95% CL 1.2, 2.1), smoking (HR 1.4, 95% CL 1.1, 1.7) and diabetes (HR 1.4, 95% CL 1.2, 1.6) were independent risk factors for mortality.<h4>Conclusion</h4>Patients with advanced CP experienced increased disease-related complications and pancreatic cancer-related mortality. Excess all-cause mortality was driven primarily by potentially modifiable risk factors including malnutrition, smoking and diabetes.https://doi.org/10.1371/journal.pone.0236817 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Fagen Xie Qiaoling Chen Yichen Zhou Wansu Chen Jemianne Bautista Emilie T Nguyen Rex A Parker Bechien U Wu |
spellingShingle |
Fagen Xie Qiaoling Chen Yichen Zhou Wansu Chen Jemianne Bautista Emilie T Nguyen Rex A Parker Bechien U Wu Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports. PLoS ONE |
author_facet |
Fagen Xie Qiaoling Chen Yichen Zhou Wansu Chen Jemianne Bautista Emilie T Nguyen Rex A Parker Bechien U Wu |
author_sort |
Fagen Xie |
title |
Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports. |
title_short |
Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports. |
title_full |
Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports. |
title_fullStr |
Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports. |
title_full_unstemmed |
Characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports. |
title_sort |
characterization of patients with advanced chronic pancreatitis using natural language processing of radiology reports. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2020-01-01 |
description |
<h4>Study aim</h4>To develop and apply a natural language processing algorithm for characterization of patients diagnosed with chronic pancreatitis in a diverse integrated U.S. healthcare system.<h4>Methods</h4>Retrospective cohort study including patients initially diagnosed with chronic pancreatitis (CP) within a regional integrated healthcare system between January 1, 2006 and December 31, 2015. Imaging reports from these patients were extracted from the electronic medical record system and split into training, validation and implementation datasets. A natural language processing (NLP) algorithm was first developed through the training dataset to identify specific features (atrophy, calcification, pseudocyst, cyst and main duct dilatation) from free-text radiology reports. The validation dataset was applied to validate the performance by comparing against the manual chart review. The developed algorithm was then applied to the implementation dataset. We classified patients with calcification(s) or ≥2 radiographic features as advanced CP. We compared etiology, comorbid conditions, treatment parameters as well as survival between advanced CP and others diagnosed during the study period.<h4>Results</h4>6,346 patients were diagnosed with CP during the study period with 58,085 radiology studies performed. For individual features, NLP yielded sensitivity from 88.7% to 95.3%, specificity from 98.2% to 100.0%. A total of 3,672 patients met cohort inclusion criteria: 1,330 (36.2%) had evidence of advanced CP. Patients with advanced CP had increased frequency of smoking (57.8% vs. 43.0%), diabetes (47.6% vs. 35.9%) and underweight body mass index (6.6% vs. 3.6%), all p<0.001. Mortality from pancreatic cancer was higher in advanced CP (15.3/1,000 person-year vs. 2.8/1,000, p<0.001). Underweight BMI (HR 1.6, 95% CL 1.2, 2.1), smoking (HR 1.4, 95% CL 1.1, 1.7) and diabetes (HR 1.4, 95% CL 1.2, 1.6) were independent risk factors for mortality.<h4>Conclusion</h4>Patients with advanced CP experienced increased disease-related complications and pancreatic cancer-related mortality. Excess all-cause mortality was driven primarily by potentially modifiable risk factors including malnutrition, smoking and diabetes. |
url |
https://doi.org/10.1371/journal.pone.0236817 |
work_keys_str_mv |
AT fagenxie characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports AT qiaolingchen characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports AT yichenzhou characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports AT wansuchen characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports AT jemiannebautista characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports AT emilietnguyen characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports AT rexaparker characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports AT bechienuwu characterizationofpatientswithadvancedchronicpancreatitisusingnaturallanguageprocessingofradiologyreports |
_version_ |
1714804310516170752 |