Discovering associations between problem list and practice setting

Abstract Background The Health Information Technology for Economic and Clinical Health Act (HITECH) has greatly accelerated the adoption of electronic health records (EHRs) with the promise of better clinical decisions and patients’ outcomes. One of the core criteria for “Meaningful Use” of EHRs is...

Full description

Bibliographic Details
Main Authors: Liwei Wang, Yanshan Wang, Feichen Shen, Majid Rastegar-Mojarad, Hongfang Liu
Format: Article
Language:English
Published: BMC 2019-04-01
Series:BMC Medical Informatics and Decision Making
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12911-019-0779-y
id doaj-a048412f131c413d93434d4fb3a15544
record_format Article
spelling doaj-a048412f131c413d93434d4fb3a155442020-11-25T03:31:59ZengBMCBMC Medical Informatics and Decision Making1472-69472019-04-0119S3132210.1186/s12911-019-0779-yDiscovering associations between problem list and practice settingLiwei Wang0Yanshan Wang1Feichen Shen2Majid Rastegar-Mojarad3Hongfang Liu4Division of Digital Health Sciences, Department of Health Sciences Research, Mayo ClinicDivision of Digital Health Sciences, Department of Health Sciences Research, Mayo ClinicDivision of Digital Health Sciences, Department of Health Sciences Research, Mayo ClinicDivision of Digital Health Sciences, Department of Health Sciences Research, Mayo ClinicDivision of Digital Health Sciences, Department of Health Sciences Research, Mayo ClinicAbstract Background The Health Information Technology for Economic and Clinical Health Act (HITECH) has greatly accelerated the adoption of electronic health records (EHRs) with the promise of better clinical decisions and patients’ outcomes. One of the core criteria for “Meaningful Use” of EHRs is to have a problem list that shows the most important health problems faced by a patient. The implementation of problem lists in EHRs has a potential to help practitioners to provide customized care to patients. However, it remains an open question on how to leverage problem lists in different practice settings to provide tailored care, of which the bottleneck lies in the associations between problem list and practice setting. Methods In this study, using sampled clinical documents associated with a cohort of patients who received their primary care at Mayo Clinic, we investigated the associations between problem list and practice setting through natural language processing (NLP) and topic modeling techniques. Specifically, after practice settings and problem lists were normalized, statistical χ2 test, term frequency-inverse document frequency (TF-IDF) and enrichment analysis were used to choose representative concepts for each setting. Then Latent Dirichlet Allocations (LDA) were used to train topic models and predict potential practice settings using similarity metrics based on the problem concepts representative of practice settings. Evaluation was conducted through 5-fold cross validation and Recall@k, Precision@k and F1@k were calculated. Results Our method can generate prioritized and meaningful problem lists corresponding to specific practice settings. For practice setting prediction, recall increases from 0.719 (k = 2) to 0.931 (k = 10), precision increases from 0.882 (k = 2) to 0.931 (k = 10) and F1 increases from 0.790 (k = 2) to 0.931 (k = 10). Conclusion To our best knowledge, our study is the first attempting to discover the association between the problem lists and hospital practice settings. In the future, we plan to investigate how to provide more tailored care by utilizing the association between problem list and practice setting revealed in this study.http://link.springer.com/article/10.1186/s12911-019-0779-yProblem listPractice settingTopic modelingStatistical χ2 testTF-IDF and enrichment analysis
collection DOAJ
language English
format Article
sources DOAJ
author Liwei Wang
Yanshan Wang
Feichen Shen
Majid Rastegar-Mojarad
Hongfang Liu
spellingShingle Liwei Wang
Yanshan Wang
Feichen Shen
Majid Rastegar-Mojarad
Hongfang Liu
Discovering associations between problem list and practice setting
BMC Medical Informatics and Decision Making
Problem list
Practice setting
Topic modeling
Statistical χ2 test
TF-IDF and enrichment analysis
author_facet Liwei Wang
Yanshan Wang
Feichen Shen
Majid Rastegar-Mojarad
Hongfang Liu
author_sort Liwei Wang
title Discovering associations between problem list and practice setting
title_short Discovering associations between problem list and practice setting
title_full Discovering associations between problem list and practice setting
title_fullStr Discovering associations between problem list and practice setting
title_full_unstemmed Discovering associations between problem list and practice setting
title_sort discovering associations between problem list and practice setting
publisher BMC
series BMC Medical Informatics and Decision Making
issn 1472-6947
publishDate 2019-04-01
description Abstract Background The Health Information Technology for Economic and Clinical Health Act (HITECH) has greatly accelerated the adoption of electronic health records (EHRs) with the promise of better clinical decisions and patients’ outcomes. One of the core criteria for “Meaningful Use” of EHRs is to have a problem list that shows the most important health problems faced by a patient. The implementation of problem lists in EHRs has a potential to help practitioners to provide customized care to patients. However, it remains an open question on how to leverage problem lists in different practice settings to provide tailored care, of which the bottleneck lies in the associations between problem list and practice setting. Methods In this study, using sampled clinical documents associated with a cohort of patients who received their primary care at Mayo Clinic, we investigated the associations between problem list and practice setting through natural language processing (NLP) and topic modeling techniques. Specifically, after practice settings and problem lists were normalized, statistical χ2 test, term frequency-inverse document frequency (TF-IDF) and enrichment analysis were used to choose representative concepts for each setting. Then Latent Dirichlet Allocations (LDA) were used to train topic models and predict potential practice settings using similarity metrics based on the problem concepts representative of practice settings. Evaluation was conducted through 5-fold cross validation and Recall@k, Precision@k and F1@k were calculated. Results Our method can generate prioritized and meaningful problem lists corresponding to specific practice settings. For practice setting prediction, recall increases from 0.719 (k = 2) to 0.931 (k = 10), precision increases from 0.882 (k = 2) to 0.931 (k = 10) and F1 increases from 0.790 (k = 2) to 0.931 (k = 10). Conclusion To our best knowledge, our study is the first attempting to discover the association between the problem lists and hospital practice settings. In the future, we plan to investigate how to provide more tailored care by utilizing the association between problem list and practice setting revealed in this study.
topic Problem list
Practice setting
Topic modeling
Statistical χ2 test
TF-IDF and enrichment analysis
url http://link.springer.com/article/10.1186/s12911-019-0779-y
work_keys_str_mv AT liweiwang discoveringassociationsbetweenproblemlistandpracticesetting
AT yanshanwang discoveringassociationsbetweenproblemlistandpracticesetting
AT feichenshen discoveringassociationsbetweenproblemlistandpracticesetting
AT majidrastegarmojarad discoveringassociationsbetweenproblemlistandpracticesetting
AT hongfangliu discoveringassociationsbetweenproblemlistandpracticesetting
_version_ 1724570422901473280