Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter

In recent years, there has been an increased prevalence of adopting state-sponsored trolls by governments and political organizations to influence public opinion through disinformation campaigns on social media platforms. This phenomenon negatively affects the political process, causes distrust in t...

Full description

Bibliographic Details
Main Author: Saleh Alhazbi
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9239285/
id doaj-0edfb82876964caab29b98349b721c35
record_format Article
spelling doaj-0edfb82876964caab29b98349b721c352021-03-30T03:41:15ZengIEEEIEEE Access2169-35362020-01-01819513219514110.1109/ACCESS.2020.30336669239285Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on TwitterSaleh Alhazbi0https://orcid.org/0000-0001-9985-9429Department of Computer Science and Engineering, College of Engineering, Qatar University, Doha, QatarIn recent years, there has been an increased prevalence of adopting state-sponsored trolls by governments and political organizations to influence public opinion through disinformation campaigns on social media platforms. This phenomenon negatively affects the political process, causes distrust in the political systems, sows discord within societies, and hastens political polarization. Thus, there is a need to develop automated approaches to identify sponsored-troll accounts on social media in order to mitigate their impacts on the political process and to protect people against opinion manipulation. In this paper, we argue that behaviors of sponsored-troll accounts on social media are different from ordinary users' because of their extrinsic motivation, and they cannot completely hide their suspicious behaviors, therefore these accounts can be identified using machine learning approaches based solely on their behaviors on the social media platforms. We have proposed a set of behavioral features of users' activities on Twitter. Based on these features, we developed four classification models to identify political troll accounts, these models are based on decision tree, random forest, Adaboost, and gradient boost algorithms. The models were trained and evaluated on a set of Saudi trolls disclosed by Twitter in 2019, the overall classification accuracy reaches up to 94.4%. The models also are capable to identify the Russian trolls with accuracy up to 72.6% without training on this set of trolls. This indicates that although the strategies of coordinated trolls might vary from an organization to another, they are all just employees and have common behaviors that can be identified.https://ieeexplore.ieee.org/document/9239285/State-sponsored trollsdisinformationpropagandabehavioral pattern
collection DOAJ
language English
format Article
sources DOAJ
author Saleh Alhazbi
spellingShingle Saleh Alhazbi
Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
IEEE Access
State-sponsored trolls
disinformation
propaganda
behavioral pattern
author_facet Saleh Alhazbi
author_sort Saleh Alhazbi
title Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
title_short Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
title_full Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
title_fullStr Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
title_full_unstemmed Behavior-Based Machine Learning Approaches to Identify State-Sponsored Trolls on Twitter
title_sort behavior-based machine learning approaches to identify state-sponsored trolls on twitter
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description In recent years, there has been an increased prevalence of adopting state-sponsored trolls by governments and political organizations to influence public opinion through disinformation campaigns on social media platforms. This phenomenon negatively affects the political process, causes distrust in the political systems, sows discord within societies, and hastens political polarization. Thus, there is a need to develop automated approaches to identify sponsored-troll accounts on social media in order to mitigate their impacts on the political process and to protect people against opinion manipulation. In this paper, we argue that behaviors of sponsored-troll accounts on social media are different from ordinary users' because of their extrinsic motivation, and they cannot completely hide their suspicious behaviors, therefore these accounts can be identified using machine learning approaches based solely on their behaviors on the social media platforms. We have proposed a set of behavioral features of users' activities on Twitter. Based on these features, we developed four classification models to identify political troll accounts, these models are based on decision tree, random forest, Adaboost, and gradient boost algorithms. The models were trained and evaluated on a set of Saudi trolls disclosed by Twitter in 2019, the overall classification accuracy reaches up to 94.4%. The models also are capable to identify the Russian trolls with accuracy up to 72.6% without training on this set of trolls. This indicates that although the strategies of coordinated trolls might vary from an organization to another, they are all just employees and have common behaviors that can be identified.
topic State-sponsored trolls
disinformation
propaganda
behavioral pattern
url https://ieeexplore.ieee.org/document/9239285/
work_keys_str_mv AT salehalhazbi behaviorbasedmachinelearningapproachestoidentifystatesponsoredtrollsontwitter
_version_ 1724183033411534848