Medical Social Media Text Classification Integrating Consumer Health Terminology
In recent years, advances in technologies, such as machine learning, natural language processing, and automated data processing, have offered potential biomedical and public health applications that use massive data sources, e.g., social media. However, current methods are underutilized for features...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8733799/ |
id |
doaj-107e6bd4f2954bf887d4fdfd95c0f718 |
---|---|
record_format |
Article |
spelling |
doaj-107e6bd4f2954bf887d4fdfd95c0f7182021-03-29T23:26:28ZengIEEEIEEE Access2169-35362019-01-017781857819310.1109/ACCESS.2019.29219388733799Medical Social Media Text Classification Integrating Consumer Health TerminologyKan Liu0https://orcid.org/0000-0002-9686-9768Lu Chen1School of Information and Safety Engineering, Zhongnan University of Economics and Law, Wuhan, ChinaSchool of Information and Safety Engineering, Zhongnan University of Economics and Law, Wuhan, ChinaIn recent years, advances in technologies, such as machine learning, natural language processing, and automated data processing, have offered potential biomedical and public health applications that use massive data sources, e.g., social media. However, current methods are underutilized for features including consumer health terminology in social media texts. In this paper, we proposed a medical social media text classification (MSMTC) algorithm that integrates consumer health terminology. Classification of text from social media on medical subjects is divided into two sub-tasks: consumer health terminology extraction and text classification. First, text characteristics based on the double channel structure are used for training, and consumer health terminology is subsequently extracted-based using an adversarial network. Then, text classification is implemented based on the extracted consumer health terminology and double channel subtraction method. This paper takes datasets containing patient descriptions from social media as an example. The experimental results show that the algorithm outperforms single channel methods or baseline models, including Convolutional Neural Networks, Long Short-Term Memory Networks, Bi-directional Long Short-Term Memory Networks, Naive Bayesian Model, and Extreme Gradient Boosting.https://ieeexplore.ieee.org/document/8733799/Adversarial networkdouble channel structuremedical social media text classification (MSMTC)terminology |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Kan Liu Lu Chen |
spellingShingle |
Kan Liu Lu Chen Medical Social Media Text Classification Integrating Consumer Health Terminology IEEE Access Adversarial network double channel structure medical social media text classification (MSMTC) terminology |
author_facet |
Kan Liu Lu Chen |
author_sort |
Kan Liu |
title |
Medical Social Media Text Classification Integrating Consumer Health Terminology |
title_short |
Medical Social Media Text Classification Integrating Consumer Health Terminology |
title_full |
Medical Social Media Text Classification Integrating Consumer Health Terminology |
title_fullStr |
Medical Social Media Text Classification Integrating Consumer Health Terminology |
title_full_unstemmed |
Medical Social Media Text Classification Integrating Consumer Health Terminology |
title_sort |
medical social media text classification integrating consumer health terminology |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
In recent years, advances in technologies, such as machine learning, natural language processing, and automated data processing, have offered potential biomedical and public health applications that use massive data sources, e.g., social media. However, current methods are underutilized for features including consumer health terminology in social media texts. In this paper, we proposed a medical social media text classification (MSMTC) algorithm that integrates consumer health terminology. Classification of text from social media on medical subjects is divided into two sub-tasks: consumer health terminology extraction and text classification. First, text characteristics based on the double channel structure are used for training, and consumer health terminology is subsequently extracted-based using an adversarial network. Then, text classification is implemented based on the extracted consumer health terminology and double channel subtraction method. This paper takes datasets containing patient descriptions from social media as an example. The experimental results show that the algorithm outperforms single channel methods or baseline models, including Convolutional Neural Networks, Long Short-Term Memory Networks, Bi-directional Long Short-Term Memory Networks, Naive Bayesian Model, and Extreme Gradient Boosting. |
topic |
Adversarial network double channel structure medical social media text classification (MSMTC) terminology |
url |
https://ieeexplore.ieee.org/document/8733799/ |
work_keys_str_mv |
AT kanliu medicalsocialmediatextclassificationintegratingconsumerhealthterminology AT luchen medicalsocialmediatextclassificationintegratingconsumerhealthterminology |
_version_ |
1724189507508502528 |