Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework

BackgroundLiver cancer is a substantial disease burden in China. As one of the primary diagnostic tools for detecting liver cancer, dynamic contrast-enhanced computed tomography provides detailed evidences for diagnosis that are recorded in free-text radiology reports....

Full description

Bibliographic Details
Main Authors: Liu, Honglei, Zhang, Zhiqiang, Xu, Yan, Wang, Ni, Huang, Yanqun, Yang, Zhenghan, Jiang, Rui, Chen, Hui
Format: Article
Language:English
Published: JMIR Publications 2021-01-01
Series:Journal of Medical Internet Research
Online Access:http://www.jmir.org/2021/1/e19689/
id doaj-79f6f5371e744495aa6c951ebecba71d
record_format Article
spelling doaj-79f6f5371e744495aa6c951ebecba71d2021-04-02T19:21:32ZengJMIR PublicationsJournal of Medical Internet Research1438-88712021-01-01231e1968910.2196/19689Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis FrameworkLiu, HongleiZhang, ZhiqiangXu, YanWang, NiHuang, YanqunYang, ZhenghanJiang, RuiChen, Hui BackgroundLiver cancer is a substantial disease burden in China. As one of the primary diagnostic tools for detecting liver cancer, dynamic contrast-enhanced computed tomography provides detailed evidences for diagnosis that are recorded in free-text radiology reports. ObjectiveThe aim of our study was to apply a deep learning model and rule-based natural language processing (NLP) method to identify evidences for liver cancer diagnosis automatically. MethodsWe proposed a pretrained, fine-tuned BERT (Bidirectional Encoder Representations from Transformers)-based BiLSTM-CRF (Bidirectional Long Short-Term Memory-Conditional Random Field) model to recognize the phrases of APHE (hyperintense enhancement in the arterial phase) and PDPH (hypointense in the portal and delayed phases). To identify more essential diagnostic evidences, we used the traditional rule-based NLP methods for the extraction of radiological features. APHE, PDPH, and other extracted radiological features were used to design a computer-aided liver cancer diagnosis framework by random forest. ResultsThe BERT-BiLSTM-CRF predicted the phrases of APHE and PDPH with an F1 score of 98.40% and 90.67%, respectively. The prediction model using combined features had a higher performance (F1 score, 88.55%) than those using APHE and PDPH (84.88%) or other extracted radiological features (83.52%). APHE and PDPH were the top 2 essential features for liver cancer diagnosis. ConclusionsThis work was a comprehensive NLP study, wherein we identified evidences for the diagnosis of liver cancer from Chinese radiology reports, considering both clinical knowledge and radiology findings. The BERT-based deep learning method for the extraction of diagnostic evidence achieved state-of-the-art performance. The high performance proves the feasibility of the BERT-BiLSTM-CRF model in information extraction from Chinese radiology reports. The findings of our study suggest that the deep learning–based method for automatically identifying evidences for diagnosis can be extended to other types of Chinese clinical texts.http://www.jmir.org/2021/1/e19689/
collection DOAJ
language English
format Article
sources DOAJ
author Liu, Honglei
Zhang, Zhiqiang
Xu, Yan
Wang, Ni
Huang, Yanqun
Yang, Zhenghan
Jiang, Rui
Chen, Hui
spellingShingle Liu, Honglei
Zhang, Zhiqiang
Xu, Yan
Wang, Ni
Huang, Yanqun
Yang, Zhenghan
Jiang, Rui
Chen, Hui
Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework
Journal of Medical Internet Research
author_facet Liu, Honglei
Zhang, Zhiqiang
Xu, Yan
Wang, Ni
Huang, Yanqun
Yang, Zhenghan
Jiang, Rui
Chen, Hui
author_sort Liu, Honglei
title Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework
title_short Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework
title_full Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework
title_fullStr Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework
title_full_unstemmed Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework
title_sort use of bert (bidirectional encoder representations from transformers)-based deep learning method for extracting evidences in chinese radiology reports: development of a computer-aided liver cancer diagnosis framework
publisher JMIR Publications
series Journal of Medical Internet Research
issn 1438-8871
publishDate 2021-01-01
description BackgroundLiver cancer is a substantial disease burden in China. As one of the primary diagnostic tools for detecting liver cancer, dynamic contrast-enhanced computed tomography provides detailed evidences for diagnosis that are recorded in free-text radiology reports. ObjectiveThe aim of our study was to apply a deep learning model and rule-based natural language processing (NLP) method to identify evidences for liver cancer diagnosis automatically. MethodsWe proposed a pretrained, fine-tuned BERT (Bidirectional Encoder Representations from Transformers)-based BiLSTM-CRF (Bidirectional Long Short-Term Memory-Conditional Random Field) model to recognize the phrases of APHE (hyperintense enhancement in the arterial phase) and PDPH (hypointense in the portal and delayed phases). To identify more essential diagnostic evidences, we used the traditional rule-based NLP methods for the extraction of radiological features. APHE, PDPH, and other extracted radiological features were used to design a computer-aided liver cancer diagnosis framework by random forest. ResultsThe BERT-BiLSTM-CRF predicted the phrases of APHE and PDPH with an F1 score of 98.40% and 90.67%, respectively. The prediction model using combined features had a higher performance (F1 score, 88.55%) than those using APHE and PDPH (84.88%) or other extracted radiological features (83.52%). APHE and PDPH were the top 2 essential features for liver cancer diagnosis. ConclusionsThis work was a comprehensive NLP study, wherein we identified evidences for the diagnosis of liver cancer from Chinese radiology reports, considering both clinical knowledge and radiology findings. The BERT-based deep learning method for the extraction of diagnostic evidence achieved state-of-the-art performance. The high performance proves the feasibility of the BERT-BiLSTM-CRF model in information extraction from Chinese radiology reports. The findings of our study suggest that the deep learning–based method for automatically identifying evidences for diagnosis can be extended to other types of Chinese clinical texts.
url http://www.jmir.org/2021/1/e19689/
work_keys_str_mv AT liuhonglei useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
AT zhangzhiqiang useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
AT xuyan useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
AT wangni useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
AT huangyanqun useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
AT yangzhenghan useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
AT jiangrui useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
AT chenhui useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
_version_ 1721548939544494080