Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
In recent years, there has been an increased prevalence of adopting state-sponsored trolls by governments and political organizations to influence public opinion through disinformation campaigns on social media platforms. This phenomenon negatively affects the political process, causes distrust in t...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9239285/ |
id |
doaj-0edfb82876964caab29b98349b721c35 |
---|---|
record_format |
Article |
spelling |
doaj-0edfb82876964caab29b98349b721c352021-03-30T03:41:15ZengIEEEIEEE Access2169-35362020-01-01819513219514110.1109/ACCESS.2020.30336669239285Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on TwitterSaleh Alhazbi0https://orcid.org/0000-0001-9985-9429Department of Computer Science and Engineering, College of Engineering, Qatar University, Doha, QatarIn recent years, there has been an increased prevalence of adopting state-sponsored trolls by governments and political organizations to influence public opinion through disinformation campaigns on social media platforms. This phenomenon negatively affects the political process, causes distrust in the political systems, sows discord within societies, and hastens political polarization. Thus, there is a need to develop automated approaches to identify sponsored-troll accounts on social media in order to mitigate their impacts on the political process and to protect people against opinion manipulation. In this paper, we argue that behaviors of sponsored-troll accounts on social media are different from ordinary users' because of their extrinsic motivation, and they cannot completely hide their suspicious behaviors, therefore these accounts can be identified using machine learning approaches based solely on their behaviors on the social media platforms. We have proposed a set of behavioral features of users' activities on Twitter. Based on these features, we developed four classification models to identify political troll accounts, these models are based on decision tree, random forest, Adaboost, and gradient boost algorithms. The models were trained and evaluated on a set of Saudi trolls disclosed by Twitter in 2019, the overall classification accuracy reaches up to 94.4%. The models also are capable to identify the Russian trolls with accuracy up to 72.6% without training on this set of trolls. This indicates that although the strategies of coordinated trolls might vary from an organization to another, they are all just employees and have common behaviors that can be identified.https://ieeexplore.ieee.org/document/9239285/State-sponsored trollsdisinformationpropagandabehavioral pattern |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Saleh Alhazbi |
spellingShingle |
Saleh Alhazbi Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter IEEE Access State-sponsored trolls disinformation propaganda behavioral pattern |
author_facet |
Saleh Alhazbi |
author_sort |
Saleh Alhazbi |
title |
Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter |
title_short |
Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter |
title_full |
Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter |
title_fullStr |
Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter |
title_full_unstemmed |
Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter |
title_sort |
behavior-based machine learning approaches to identify state-sponsored trolls on twitter |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
In recent years, there has been an increased prevalence of adopting state-sponsored trolls by governments and political organizations to influence public opinion through disinformation campaigns on social media platforms. This phenomenon negatively affects the political process, causes distrust in the political systems, sows discord within societies, and hastens political polarization. Thus, there is a need to develop automated approaches to identify sponsored-troll accounts on social media in order to mitigate their impacts on the political process and to protect people against opinion manipulation. In this paper, we argue that behaviors of sponsored-troll accounts on social media are different from ordinary users' because of their extrinsic motivation, and they cannot completely hide their suspicious behaviors, therefore these accounts can be identified using machine learning approaches based solely on their behaviors on the social media platforms. We have proposed a set of behavioral features of users' activities on Twitter. Based on these features, we developed four classification models to identify political troll accounts, these models are based on decision tree, random forest, Adaboost, and gradient boost algorithms. The models were trained and evaluated on a set of Saudi trolls disclosed by Twitter in 2019, the overall classification accuracy reaches up to 94.4%. The models also are capable to identify the Russian trolls with accuracy up to 72.6% without training on this set of trolls. This indicates that although the strategies of coordinated trolls might vary from an organization to another, they are all just employees and have common behaviors that can be identified. |
topic |
State-sponsored trolls disinformation propaganda behavioral pattern |
url |
https://ieeexplore.ieee.org/document/9239285/ |
work_keys_str_mv |
AT salehalhazbi behaviorbasedmachinelearningapproachestoidentifystatesponsoredtrollsontwitter |
_version_ |
1724183033411534848 |