A Novel Hybrid Model for Cantonese Rumor Detection on Twitter

The development of information technology and mobile Internet has spawned the prosperity of online social networks. As the world’s largest microblogging platform, Twitter is popular among people all over the world. However, as the number of users on Twitter increases, rumors have become a serious pr...

Full description

Bibliographic Details
Main Authors: Xinyu Chen, Liang Ke, Zhipeng Lu, Hanjian Su, Haizhou Wang
Format: Article
Language:English
Published: MDPI AG 2020-10-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/10/20/7093
id doaj-09bcddddf81f451d852bb8059ab4b041
record_format Article
spelling doaj-09bcddddf81f451d852bb8059ab4b0412020-11-25T03:55:41ZengMDPI AGApplied Sciences2076-34172020-10-01107093709310.3390/app10207093A Novel Hybrid Model for Cantonese Rumor Detection on TwitterXinyu Chen0Liang Ke1Zhipeng Lu2Hanjian Su3Haizhou Wang4College of Cybersecurity, Sichuan University, Chengdu 610064, ChinaCollege of Cybersecurity, Sichuan University, Chengdu 610064, ChinaCollege of Cybersecurity, Sichuan University, Chengdu 610064, ChinaCollege of Cybersecurity, Sichuan University, Chengdu 610064, ChinaCollege of Cybersecurity, Sichuan University, Chengdu 610064, ChinaThe development of information technology and mobile Internet has spawned the prosperity of online social networks. As the world’s largest microblogging platform, Twitter is popular among people all over the world. However, as the number of users on Twitter increases, rumors have become a serious problem. Therefore, rumor detection is necessary since it can prevent unverified information from causing public panic and disrupting social order. Cantonese is a widely used language in China. However, to the best of our knowledge, little research has been done on Cantonese rumor detection. In this paper, we propose a novel hybrid model XGA (namely XLNet-based Bidirectional Gated Recurrent Unit (BiGRU) network with Attention mechanism) for Cantonese rumor detection on Twitter. Specifically, we take advantage of both semantic and sentiment features for detection. First of all, XLNet is employed to produce text-based and sentiment-based embeddings at the character level. Then we perform joint learning of character and word embeddings to obtain the words’ external contexts and internal structures. In addition, we leverage BiGRU and the attention mechanism to obtain important semantic features and use the Cantonese rumor dataset we constructed to train our proposed model. The experimental results show that the XGA model outperforms the other popular models in Cantonese rumor detection. The research in this paper provides methods and ideas for future work in Cantonese rumor detection on other social networking platforms.https://www.mdpi.com/2076-3417/10/20/7093online social networksrumor detectionCantoneseXGA model
collection DOAJ
language English
format Article
sources DOAJ
author Xinyu Chen
Liang Ke
Zhipeng Lu
Hanjian Su
Haizhou Wang
spellingShingle Xinyu Chen
Liang Ke
Zhipeng Lu
Hanjian Su
Haizhou Wang
A Novel Hybrid Model for Cantonese Rumor Detection on Twitter
Applied Sciences
online social networks
rumor detection
Cantonese
XGA model
author_facet Xinyu Chen
Liang Ke
Zhipeng Lu
Hanjian Su
Haizhou Wang
author_sort Xinyu Chen
title A Novel Hybrid Model for Cantonese Rumor Detection on Twitter
title_short A Novel Hybrid Model for Cantonese Rumor Detection on Twitter
title_full A Novel Hybrid Model for Cantonese Rumor Detection on Twitter
title_fullStr A Novel Hybrid Model for Cantonese Rumor Detection on Twitter
title_full_unstemmed A Novel Hybrid Model for Cantonese Rumor Detection on Twitter
title_sort novel hybrid model for cantonese rumor detection on twitter
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2020-10-01
description The development of information technology and mobile Internet has spawned the prosperity of online social networks. As the world’s largest microblogging platform, Twitter is popular among people all over the world. However, as the number of users on Twitter increases, rumors have become a serious problem. Therefore, rumor detection is necessary since it can prevent unverified information from causing public panic and disrupting social order. Cantonese is a widely used language in China. However, to the best of our knowledge, little research has been done on Cantonese rumor detection. In this paper, we propose a novel hybrid model XGA (namely XLNet-based Bidirectional Gated Recurrent Unit (BiGRU) network with Attention mechanism) for Cantonese rumor detection on Twitter. Specifically, we take advantage of both semantic and sentiment features for detection. First of all, XLNet is employed to produce text-based and sentiment-based embeddings at the character level. Then we perform joint learning of character and word embeddings to obtain the words’ external contexts and internal structures. In addition, we leverage BiGRU and the attention mechanism to obtain important semantic features and use the Cantonese rumor dataset we constructed to train our proposed model. The experimental results show that the XGA model outperforms the other popular models in Cantonese rumor detection. The research in this paper provides methods and ideas for future work in Cantonese rumor detection on other social networking platforms.
topic online social networks
rumor detection
Cantonese
XGA model
url https://www.mdpi.com/2076-3417/10/20/7093
work_keys_str_mv AT xinyuchen anovelhybridmodelforcantoneserumordetectionontwitter
AT liangke anovelhybridmodelforcantoneserumordetectionontwitter
AT zhipenglu anovelhybridmodelforcantoneserumordetectionontwitter
AT hanjiansu anovelhybridmodelforcantoneserumordetectionontwitter
AT haizhouwang anovelhybridmodelforcantoneserumordetectionontwitter
AT xinyuchen novelhybridmodelforcantoneserumordetectionontwitter
AT liangke novelhybridmodelforcantoneserumordetectionontwitter
AT zhipenglu novelhybridmodelforcantoneserumordetectionontwitter
AT hanjiansu novelhybridmodelforcantoneserumordetectionontwitter
AT haizhouwang novelhybridmodelforcantoneserumordetectionontwitter
_version_ 1724468729135235072