Medical Social Media Text Classification Integrating Consumer Health Terminology

In recent years, advances in technologies, such as machine learning, natural language processing, and automated data processing, have offered potential biomedical and public health applications that use massive data sources, e.g., social media. However, current methods are underutilized for features...

Full description

Bibliographic Details
Main Authors: Kan Liu, Lu Chen
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8733799/
id doaj-107e6bd4f2954bf887d4fdfd95c0f718
record_format Article
spelling doaj-107e6bd4f2954bf887d4fdfd95c0f7182021-03-29T23:26:28ZengIEEEIEEE Access2169-35362019-01-017781857819310.1109/ACCESS.2019.29219388733799Medical Social Media Text Classification Integrating Consumer Health TerminologyKan Liu0https://orcid.org/0000-0002-9686-9768Lu Chen1School of Information and Safety Engineering, Zhongnan University of Economics and Law, Wuhan, ChinaSchool of Information and Safety Engineering, Zhongnan University of Economics and Law, Wuhan, ChinaIn recent years, advances in technologies, such as machine learning, natural language processing, and automated data processing, have offered potential biomedical and public health applications that use massive data sources, e.g., social media. However, current methods are underutilized for features including consumer health terminology in social media texts. In this paper, we proposed a medical social media text classification (MSMTC) algorithm that integrates consumer health terminology. Classification of text from social media on medical subjects is divided into two sub-tasks: consumer health terminology extraction and text classification. First, text characteristics based on the double channel structure are used for training, and consumer health terminology is subsequently extracted-based using an adversarial network. Then, text classification is implemented based on the extracted consumer health terminology and double channel subtraction method. This paper takes datasets containing patient descriptions from social media as an example. The experimental results show that the algorithm outperforms single channel methods or baseline models, including Convolutional Neural Networks, Long Short-Term Memory Networks, Bi-directional Long Short-Term Memory Networks, Naive Bayesian Model, and Extreme Gradient Boosting.https://ieeexplore.ieee.org/document/8733799/Adversarial networkdouble channel structuremedical social media text classification (MSMTC)terminology
collection DOAJ
language English
format Article
sources DOAJ
author Kan Liu
Lu Chen
spellingShingle Kan Liu
Lu Chen
Medical Social Media Text Classification Integrating Consumer Health Terminology
IEEE Access
Adversarial network
double channel structure
medical social media text classification (MSMTC)
terminology
author_facet Kan Liu
Lu Chen
author_sort Kan Liu
title Medical Social Media Text Classification Integrating Consumer Health Terminology
title_short Medical Social Media Text Classification Integrating Consumer Health Terminology
title_full Medical Social Media Text Classification Integrating Consumer Health Terminology
title_fullStr Medical Social Media Text Classification Integrating Consumer Health Terminology
title_full_unstemmed Medical Social Media Text Classification Integrating Consumer Health Terminology
title_sort medical social media text classification integrating consumer health terminology
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description In recent years, advances in technologies, such as machine learning, natural language processing, and automated data processing, have offered potential biomedical and public health applications that use massive data sources, e.g., social media. However, current methods are underutilized for features including consumer health terminology in social media texts. In this paper, we proposed a medical social media text classification (MSMTC) algorithm that integrates consumer health terminology. Classification of text from social media on medical subjects is divided into two sub-tasks: consumer health terminology extraction and text classification. First, text characteristics based on the double channel structure are used for training, and consumer health terminology is subsequently extracted-based using an adversarial network. Then, text classification is implemented based on the extracted consumer health terminology and double channel subtraction method. This paper takes datasets containing patient descriptions from social media as an example. The experimental results show that the algorithm outperforms single channel methods or baseline models, including Convolutional Neural Networks, Long Short-Term Memory Networks, Bi-directional Long Short-Term Memory Networks, Naive Bayesian Model, and Extreme Gradient Boosting.
topic Adversarial network
double channel structure
medical social media text classification (MSMTC)
terminology
url https://ieeexplore.ieee.org/document/8733799/
work_keys_str_mv AT kanliu medicalsocialmediatextclassificationintegratingconsumerhealthterminology
AT luchen medicalsocialmediatextclassificationintegratingconsumerhealthterminology
_version_ 1724189507508502528