A Novel Deep Similarity Learning Approach to Electronic Health Records Data

The past decade has seen a tremendous advancement in using Electronic Health Records (EHRs) to offer clinical decision support and provide personalized healthcare to patients. Despite the potential benefits offered by EHR data, it is challenging to represent and analyze large EHRs for predictive mod...

Full description

Bibliographic Details
Main Authors: Vagisha Gupta, Shelly Sachdeva, Subhash Bhalla
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9257424/
id doaj-254c84759eff46569b67508341956998
record_format Article
spelling doaj-254c84759eff46569b675083419569982021-03-30T03:35:03ZengIEEEIEEE Access2169-35362020-01-01820927820929510.1109/ACCESS.2020.30377109257424A Novel Deep Similarity Learning Approach to Electronic Health Records DataVagisha Gupta0Shelly Sachdeva1https://orcid.org/0000-0003-4088-1271Subhash Bhalla2Department of Computer Science and Engineering, National Institute of Technology Delhi (NITD), Delhi, IndiaDepartment of Computer Science and Engineering, National Institute of Technology Delhi (NITD), Delhi, IndiaDepartment of Computer Science and Engineering, The University of Aizu, Aizuwakamatsu, JapanThe past decade has seen a tremendous advancement in using Electronic Health Records (EHRs) to offer clinical decision support and provide personalized healthcare to patients. Despite the potential benefits offered by EHR data, it is challenging to represent and analyze large EHRs for predictive modeling due to heterogeneity, high dimensionality, and sparsity. This article proposes a novel supervised Deep Similarity Learning approach that learns the patient representations and also finds the relationship between the patients using pairwise similarity learning to facilitate predictive analysis for personalized healthcare. We develop CNN_Softmax which is a Siamese-based neural network for multi-class classification methods corresponding to the prediction of disease. It uses Convolutional Neural Network (CNN) to study the vector representation of raw EHRs and capture essential information of patient features, and a Softmax-based supervised classification method that learns the similarity between pairs of patients and performs disease prediction using this similarity information. Our approach uses data type mapping to handle heterogeneity and the polynomial interpolation method to handle sparsity existing in EHR data. ORBDA, which is an openEHR (standard) benchmark dataset, is used for evaluating this study. Experimental results show that CNN_Softmax achieves an accuracy of 97.8%, a recall of 98.1%, a precision of 96.02%, and an F1 score of 97.82%. The comparative results show that our proposed novel methodology performs disease prediction with highly promising results and outperforms state-of-the-art similarity learning methods. The current study is the first attempt to perform disease prediction on standardized EHRs, to the best of the authors' knowledge. The deep similarity learning approach provides support for clinical decision making that is more reliable and generalizable than previous approaches and focuses on dealing with heterogeneous and sparse data. The concept also serves as a new implementation of artificial intelligence technologies for the application of clinical big data.https://ieeexplore.ieee.org/document/9257424/Convolutional neural networksdeep learningelectronic health recordsnephrologysimilarity learningsoftmax-based technique
collection DOAJ
language English
format Article
sources DOAJ
author Vagisha Gupta
Shelly Sachdeva
Subhash Bhalla
spellingShingle Vagisha Gupta
Shelly Sachdeva
Subhash Bhalla
A Novel Deep Similarity Learning Approach to Electronic Health Records Data
IEEE Access
Convolutional neural networks
deep learning
electronic health records
nephrology
similarity learning
softmax-based technique
author_facet Vagisha Gupta
Shelly Sachdeva
Subhash Bhalla
author_sort Vagisha Gupta
title A Novel Deep Similarity Learning Approach to Electronic Health Records Data
title_short A Novel Deep Similarity Learning Approach to Electronic Health Records Data
title_full A Novel Deep Similarity Learning Approach to Electronic Health Records Data
title_fullStr A Novel Deep Similarity Learning Approach to Electronic Health Records Data
title_full_unstemmed A Novel Deep Similarity Learning Approach to Electronic Health Records Data
title_sort novel deep similarity learning approach to electronic health records data
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description The past decade has seen a tremendous advancement in using Electronic Health Records (EHRs) to offer clinical decision support and provide personalized healthcare to patients. Despite the potential benefits offered by EHR data, it is challenging to represent and analyze large EHRs for predictive modeling due to heterogeneity, high dimensionality, and sparsity. This article proposes a novel supervised Deep Similarity Learning approach that learns the patient representations and also finds the relationship between the patients using pairwise similarity learning to facilitate predictive analysis for personalized healthcare. We develop CNN_Softmax which is a Siamese-based neural network for multi-class classification methods corresponding to the prediction of disease. It uses Convolutional Neural Network (CNN) to study the vector representation of raw EHRs and capture essential information of patient features, and a Softmax-based supervised classification method that learns the similarity between pairs of patients and performs disease prediction using this similarity information. Our approach uses data type mapping to handle heterogeneity and the polynomial interpolation method to handle sparsity existing in EHR data. ORBDA, which is an openEHR (standard) benchmark dataset, is used for evaluating this study. Experimental results show that CNN_Softmax achieves an accuracy of 97.8%, a recall of 98.1%, a precision of 96.02%, and an F1 score of 97.82%. The comparative results show that our proposed novel methodology performs disease prediction with highly promising results and outperforms state-of-the-art similarity learning methods. The current study is the first attempt to perform disease prediction on standardized EHRs, to the best of the authors' knowledge. The deep similarity learning approach provides support for clinical decision making that is more reliable and generalizable than previous approaches and focuses on dealing with heterogeneous and sparse data. The concept also serves as a new implementation of artificial intelligence technologies for the application of clinical big data.
topic Convolutional neural networks
deep learning
electronic health records
nephrology
similarity learning
softmax-based technique
url https://ieeexplore.ieee.org/document/9257424/
work_keys_str_mv AT vagishagupta anoveldeepsimilaritylearningapproachtoelectronichealthrecordsdata
AT shellysachdeva anoveldeepsimilaritylearningapproachtoelectronichealthrecordsdata
AT subhashbhalla anoveldeepsimilaritylearningapproachtoelectronichealthrecordsdata
AT vagishagupta noveldeepsimilaritylearningapproachtoelectronichealthrecordsdata
AT shellysachdeva noveldeepsimilaritylearningapproachtoelectronichealthrecordsdata
AT subhashbhalla noveldeepsimilaritylearningapproachtoelectronichealthrecordsdata
_version_ 1724183159672668160