A Novel Deep Similarity Learning Approach to Electronic Health Records Data
The past decade has seen a tremendous advancement in using Electronic Health Records (EHRs) to offer clinical decision support and provide personalized healthcare to patients. Despite the potential benefits offered by EHR data, it is challenging to represent and analyze large EHRs for predictive mod...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9257424/ |
id |
doaj-254c84759eff46569b67508341956998 |
---|---|
record_format |
Article |
spelling |
doaj-254c84759eff46569b675083419569982021-03-30T03:35:03ZengIEEEIEEE Access2169-35362020-01-01820927820929510.1109/ACCESS.2020.30377109257424A Novel Deep Similarity Learning Approach to Electronic Health Records DataVagisha Gupta0Shelly Sachdeva1https://orcid.org/0000-0003-4088-1271Subhash Bhalla2Department of Computer Science and Engineering, National Institute of Technology Delhi (NITD), Delhi, IndiaDepartment of Computer Science and Engineering, National Institute of Technology Delhi (NITD), Delhi, IndiaDepartment of Computer Science and Engineering, The University of Aizu, Aizuwakamatsu, JapanThe past decade has seen a tremendous advancement in using Electronic Health Records (EHRs) to offer clinical decision support and provide personalized healthcare to patients. Despite the potential benefits offered by EHR data, it is challenging to represent and analyze large EHRs for predictive modeling due to heterogeneity, high dimensionality, and sparsity. This article proposes a novel supervised Deep Similarity Learning approach that learns the patient representations and also finds the relationship between the patients using pairwise similarity learning to facilitate predictive analysis for personalized healthcare. We develop CNN_Softmax which is a Siamese-based neural network for multi-class classification methods corresponding to the prediction of disease. It uses Convolutional Neural Network (CNN) to study the vector representation of raw EHRs and capture essential information of patient features, and a Softmax-based supervised classification method that learns the similarity between pairs of patients and performs disease prediction using this similarity information. Our approach uses data type mapping to handle heterogeneity and the polynomial interpolation method to handle sparsity existing in EHR data. ORBDA, which is an openEHR (standard) benchmark dataset, is used for evaluating this study. Experimental results show that CNN_Softmax achieves an accuracy of 97.8%, a recall of 98.1%, a precision of 96.02%, and an F1 score of 97.82%. The comparative results show that our proposed novel methodology performs disease prediction with highly promising results and outperforms state-of-the-art similarity learning methods. The current study is the first attempt to perform disease prediction on standardized EHRs, to the best of the authors' knowledge. The deep similarity learning approach provides support for clinical decision making that is more reliable and generalizable than previous approaches and focuses on dealing with heterogeneous and sparse data. The concept also serves as a new implementation of artificial intelligence technologies for the application of clinical big data.https://ieeexplore.ieee.org/document/9257424/Convolutional neural networksdeep learningelectronic health recordsnephrologysimilarity learningsoftmax-based technique |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Vagisha Gupta Shelly Sachdeva Subhash Bhalla |
spellingShingle |
Vagisha Gupta Shelly Sachdeva Subhash Bhalla A Novel Deep Similarity Learning Approach to Electronic Health Records Data IEEE Access Convolutional neural networks deep learning electronic health records nephrology similarity learning softmax-based technique |
author_facet |
Vagisha Gupta Shelly Sachdeva Subhash Bhalla |
author_sort |
Vagisha Gupta |
title |
A Novel Deep Similarity Learning Approach to Electronic Health Records Data |
title_short |
A Novel Deep Similarity Learning Approach to Electronic Health Records Data |
title_full |
A Novel Deep Similarity Learning Approach to Electronic Health Records Data |
title_fullStr |
A Novel Deep Similarity Learning Approach to Electronic Health Records Data |
title_full_unstemmed |
A Novel Deep Similarity Learning Approach to Electronic Health Records Data |
title_sort |
novel deep similarity learning approach to electronic health records data |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
The past decade has seen a tremendous advancement in using Electronic Health Records (EHRs) to offer clinical decision support and provide personalized healthcare to patients. Despite the potential benefits offered by EHR data, it is challenging to represent and analyze large EHRs for predictive modeling due to heterogeneity, high dimensionality, and sparsity. This article proposes a novel supervised Deep Similarity Learning approach that learns the patient representations and also finds the relationship between the patients using pairwise similarity learning to facilitate predictive analysis for personalized healthcare. We develop CNN_Softmax which is a Siamese-based neural network for multi-class classification methods corresponding to the prediction of disease. It uses Convolutional Neural Network (CNN) to study the vector representation of raw EHRs and capture essential information of patient features, and a Softmax-based supervised classification method that learns the similarity between pairs of patients and performs disease prediction using this similarity information. Our approach uses data type mapping to handle heterogeneity and the polynomial interpolation method to handle sparsity existing in EHR data. ORBDA, which is an openEHR (standard) benchmark dataset, is used for evaluating this study. Experimental results show that CNN_Softmax achieves an accuracy of 97.8%, a recall of 98.1%, a precision of 96.02%, and an F1 score of 97.82%. The comparative results show that our proposed novel methodology performs disease prediction with highly promising results and outperforms state-of-the-art similarity learning methods. The current study is the first attempt to perform disease prediction on standardized EHRs, to the best of the authors' knowledge. The deep similarity learning approach provides support for clinical decision making that is more reliable and generalizable than previous approaches and focuses on dealing with heterogeneous and sparse data. The concept also serves as a new implementation of artificial intelligence technologies for the application of clinical big data. |
topic |
Convolutional neural networks deep learning electronic health records nephrology similarity learning softmax-based technique |
url |
https://ieeexplore.ieee.org/document/9257424/ |
work_keys_str_mv |
AT vagishagupta anoveldeepsimilaritylearningapproachtoelectronichealthrecordsdata AT shellysachdeva anoveldeepsimilaritylearningapproachtoelectronichealthrecordsdata AT subhashbhalla anoveldeepsimilaritylearningapproachtoelectronichealthrecordsdata AT vagishagupta noveldeepsimilaritylearningapproachtoelectronichealthrecordsdata AT shellysachdeva noveldeepsimilaritylearningapproachtoelectronichealthrecordsdata AT subhashbhalla noveldeepsimilaritylearningapproachtoelectronichealthrecordsdata |
_version_ |
1724183159672668160 |