Twitter social bots: The 2019 Spanish general election data

The term social bots refer to software-controlled accounts that actively participate in the social platforms to influence public opinion toward desired directions. To this extent, this data descriptor presents a Twitter dataset collected from October 4th to November 11th, 2019, within the context of...

Full description

Bibliographic Details
Main Authors: Javier Pastor-Galindo, Mattia Zago, Pantaleone Nespoli, Sergio López Bernal, Alberto Huertas Celdrán, Manuel Gil Pérez, José A. Ruipérez-Valiente, Gregorio Martínez Pérez, Félix Gómez Mármol
Format: Article
Language:English
Published: Elsevier 2020-10-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340920309410
id doaj-d6493cd087574b05bf5409a5f987f130
record_format Article
spelling doaj-d6493cd087574b05bf5409a5f987f1302020-11-25T04:06:11ZengElsevierData in Brief2352-34092020-10-0132106047Twitter social bots: The 2019 Spanish general election dataJavier Pastor-Galindo0Mattia Zago1Pantaleone Nespoli2Sergio López Bernal3Alberto Huertas Celdrán4Manuel Gil Pérez5José A. Ruipérez-Valiente6Gregorio Martínez Pérez7Félix Gómez Mármol8Department of Information Engineering and Communications, University of Murcia, Murcia, Spain; Corresponding authors.Department of Information Engineering and Communications, University of Murcia, Murcia, Spain; Corresponding authors.Department of Information Engineering and Communications, University of Murcia, Murcia, SpainDepartment of Information Engineering and Communications, University of Murcia, Murcia, SpainTelecommunication Software & Systems Group, Waterford Institute of Technology, Cork Rd, Waterford, IrelandDepartment of Information Engineering and Communications, University of Murcia, Murcia, SpainDepartment of Information Engineering and Communications, University of Murcia, Murcia, SpainDepartment of Information Engineering and Communications, University of Murcia, Murcia, SpainDepartment of Information Engineering and Communications, University of Murcia, Murcia, SpainThe term social bots refer to software-controlled accounts that actively participate in the social platforms to influence public opinion toward desired directions. To this extent, this data descriptor presents a Twitter dataset collected from October 4th to November 11th, 2019, within the context of the Spanish general election. Starting from 46 hashtags, the collection contains almost eight hundred thousand users involved in political discussions, with a total of 5.8 million tweets. The proposed data descriptor is related to the research article available at [1]. Its main objectives are: i) to enable worldwide researchers to improve the data gathering, organization, and preprocessing phases; ii) to test machine-learning-powered proposals; and, finally, iii) to improve state-of-the-art solutions on social bots detection, analysis, and classification. Note that the data are anonymized to preserve the privacy of the users. Throughout our analysis, we enriched the collected data with meaningful features in addition to the ones provided by Twitter. In particular, the tweets collection presents the tweets’ topic mentions and keywords (in the form of political bag-of-words), and the sentiment score. The users’ collection includes one field indicating the likelihood of one account being a bot. Furthermore, for those accounts classified as bots, it also includes a score that indicates the affinity to a political party and the followers/followings list.http://www.sciencedirect.com/science/article/pii/S2352340920309410Social bots detectionSocial bots classificationMachine learningSentiment analysisSocial network analysis
collection DOAJ
language English
format Article
sources DOAJ
author Javier Pastor-Galindo
Mattia Zago
Pantaleone Nespoli
Sergio López Bernal
Alberto Huertas Celdrán
Manuel Gil Pérez
José A. Ruipérez-Valiente
Gregorio Martínez Pérez
Félix Gómez Mármol
spellingShingle Javier Pastor-Galindo
Mattia Zago
Pantaleone Nespoli
Sergio López Bernal
Alberto Huertas Celdrán
Manuel Gil Pérez
José A. Ruipérez-Valiente
Gregorio Martínez Pérez
Félix Gómez Mármol
Twitter social bots: The 2019 Spanish general election data
Data in Brief
Social bots detection
Social bots classification
Machine learning
Sentiment analysis
Social network analysis
author_facet Javier Pastor-Galindo
Mattia Zago
Pantaleone Nespoli
Sergio López Bernal
Alberto Huertas Celdrán
Manuel Gil Pérez
José A. Ruipérez-Valiente
Gregorio Martínez Pérez
Félix Gómez Mármol
author_sort Javier Pastor-Galindo
title Twitter social bots: The 2019 Spanish general election data
title_short Twitter social bots: The 2019 Spanish general election data
title_full Twitter social bots: The 2019 Spanish general election data
title_fullStr Twitter social bots: The 2019 Spanish general election data
title_full_unstemmed Twitter social bots: The 2019 Spanish general election data
title_sort twitter social bots: the 2019 spanish general election data
publisher Elsevier
series Data in Brief
issn 2352-3409
publishDate 2020-10-01
description The term social bots refer to software-controlled accounts that actively participate in the social platforms to influence public opinion toward desired directions. To this extent, this data descriptor presents a Twitter dataset collected from October 4th to November 11th, 2019, within the context of the Spanish general election. Starting from 46 hashtags, the collection contains almost eight hundred thousand users involved in political discussions, with a total of 5.8 million tweets. The proposed data descriptor is related to the research article available at [1]. Its main objectives are: i) to enable worldwide researchers to improve the data gathering, organization, and preprocessing phases; ii) to test machine-learning-powered proposals; and, finally, iii) to improve state-of-the-art solutions on social bots detection, analysis, and classification. Note that the data are anonymized to preserve the privacy of the users. Throughout our analysis, we enriched the collected data with meaningful features in addition to the ones provided by Twitter. In particular, the tweets collection presents the tweets’ topic mentions and keywords (in the form of political bag-of-words), and the sentiment score. The users’ collection includes one field indicating the likelihood of one account being a bot. Furthermore, for those accounts classified as bots, it also includes a score that indicates the affinity to a political party and the followers/followings list.
topic Social bots detection
Social bots classification
Machine learning
Sentiment analysis
Social network analysis
url http://www.sciencedirect.com/science/article/pii/S2352340920309410
work_keys_str_mv AT javierpastorgalindo twittersocialbotsthe2019spanishgeneralelectiondata
AT mattiazago twittersocialbotsthe2019spanishgeneralelectiondata
AT pantaleonenespoli twittersocialbotsthe2019spanishgeneralelectiondata
AT sergiolopezbernal twittersocialbotsthe2019spanishgeneralelectiondata
AT albertohuertasceldran twittersocialbotsthe2019spanishgeneralelectiondata
AT manuelgilperez twittersocialbotsthe2019spanishgeneralelectiondata
AT josearuiperezvaliente twittersocialbotsthe2019spanishgeneralelectiondata
AT gregoriomartinezperez twittersocialbotsthe2019spanishgeneralelectiondata
AT felixgomezmarmol twittersocialbotsthe2019spanishgeneralelectiondata
_version_ 1724432050268667904