Stance Detection of Microblog Text Based on Two-Channel CNN-GRU Fusion Network
In recent years, stance detection has become an important topic in the field of natural language processing. In earlier work, researchers have used feature engineering for stance detection but they need to define and extract appropriate features according to the particular application. This leads to...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8851136/ |
id |
doaj-91d792e6ad3740e9863a2b1224638d10 |
---|---|
record_format |
Article |
spelling |
doaj-91d792e6ad3740e9863a2b1224638d102021-03-30T00:35:00ZengIEEEIEEE Access2169-35362019-01-01714594414595210.1109/ACCESS.2019.29441368851136Stance Detection of Microblog Text Based on Two-Channel CNN-GRU Fusion NetworkWenfa Li0Yilong Xu1Gongming Wang2https://orcid.org/0000-0002-0466-0888Robotics College, Beijing Union University, Beijing, ChinaSmart City College, Beijing Union University, Beijing, ChinaInspur Software Group Company Ltd., Jinan, ChinaIn recent years, stance detection has become an important topic in the field of natural language processing. In earlier work, researchers have used feature engineering for stance detection but they need to define and extract appropriate features according to the particular application. This leads to poor generalization and a complex modeling process. Other researchers have applied deep learning methods. However, the popular convolutional neural network (CNN) method has the problem of information loss and a single-size CNN filter cannot accurately extract features that have different lengths from text, and so cannot deal with the diverse nature of features. In order to address these problems, we propose a two-channel CNN-GRU fusion network. First, a convolution layer with two filters with different window sizes is used to extract local features within the topic content and text content. Then, a gated recurrent unit (GRU) network is used to extract their timing characteristics. After that, the intermediate features are spliced and input to a classifier to complete the stance detection. Our method is validated using data from NLPCC 2016. The experimental results show that ACC and average F1 score of this method are 13.1% and 15.6% better than SVM method, 6.2% and 11.6% better than CNN method, 5.6% and 3.3% better than GRU method, and 1.1% and 2.2% better compared with hybrid model proposed by Nanyu, respectively, which is used as a baseline with no increase in run-time, and achieves the same accuracy with less run-time than another baseline of a semantic attention-based model proposed by Zhou. In addition, our method allows better classification than the single channel model. Finally, we find that the operation time of a multi-channel CNN-GRU increases gradually with increasing number of channels, but the classification accuracy does not improve, so a two-channel CNN-GRU is the most appropriate choice.https://ieeexplore.ieee.org/document/8851136/Stance detectionnatural language processingdeep learningCNNGRU |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Wenfa Li Yilong Xu Gongming Wang |
spellingShingle |
Wenfa Li Yilong Xu Gongming Wang Stance Detection of Microblog Text Based on Two-Channel CNN-GRU Fusion Network IEEE Access Stance detection natural language processing deep learning CNN GRU |
author_facet |
Wenfa Li Yilong Xu Gongming Wang |
author_sort |
Wenfa Li |
title |
Stance Detection of Microblog Text Based on Two-Channel CNN-GRU Fusion Network |
title_short |
Stance Detection of Microblog Text Based on Two-Channel CNN-GRU Fusion Network |
title_full |
Stance Detection of Microblog Text Based on Two-Channel CNN-GRU Fusion Network |
title_fullStr |
Stance Detection of Microblog Text Based on Two-Channel CNN-GRU Fusion Network |
title_full_unstemmed |
Stance Detection of Microblog Text Based on Two-Channel CNN-GRU Fusion Network |
title_sort |
stance detection of microblog text based on two-channel cnn-gru fusion network |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
In recent years, stance detection has become an important topic in the field of natural language processing. In earlier work, researchers have used feature engineering for stance detection but they need to define and extract appropriate features according to the particular application. This leads to poor generalization and a complex modeling process. Other researchers have applied deep learning methods. However, the popular convolutional neural network (CNN) method has the problem of information loss and a single-size CNN filter cannot accurately extract features that have different lengths from text, and so cannot deal with the diverse nature of features. In order to address these problems, we propose a two-channel CNN-GRU fusion network. First, a convolution layer with two filters with different window sizes is used to extract local features within the topic content and text content. Then, a gated recurrent unit (GRU) network is used to extract their timing characteristics. After that, the intermediate features are spliced and input to a classifier to complete the stance detection. Our method is validated using data from NLPCC 2016. The experimental results show that ACC and average F1 score of this method are 13.1% and 15.6% better than SVM method, 6.2% and 11.6% better than CNN method, 5.6% and 3.3% better than GRU method, and 1.1% and 2.2% better compared with hybrid model proposed by Nanyu, respectively, which is used as a baseline with no increase in run-time, and achieves the same accuracy with less run-time than another baseline of a semantic attention-based model proposed by Zhou. In addition, our method allows better classification than the single channel model. Finally, we find that the operation time of a multi-channel CNN-GRU increases gradually with increasing number of channels, but the classification accuracy does not improve, so a two-channel CNN-GRU is the most appropriate choice. |
topic |
Stance detection natural language processing deep learning CNN GRU |
url |
https://ieeexplore.ieee.org/document/8851136/ |
work_keys_str_mv |
AT wenfali stancedetectionofmicroblogtextbasedontwochannelcnngrufusionnetwork AT yilongxu stancedetectionofmicroblogtextbasedontwochannelcnngrufusionnetwork AT gongmingwang stancedetectionofmicroblogtextbasedontwochannelcnngrufusionnetwork |
_version_ |
1724188140693880832 |