Twitter social bots: The 2019 Spanish general election data
The term social bots refer to software-controlled accounts that actively participate in the social platforms to influence public opinion toward desired directions. To this extent, this data descriptor presents a Twitter dataset collected from October 4th to November 11th, 2019, within the context of...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2020-10-01
|
Series: | Data in Brief |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340920309410 |
id |
doaj-d6493cd087574b05bf5409a5f987f130 |
---|---|
record_format |
Article |
spelling |
doaj-d6493cd087574b05bf5409a5f987f1302020-11-25T04:06:11ZengElsevierData in Brief2352-34092020-10-0132106047Twitter social bots: The 2019 Spanish general election dataJavier Pastor-Galindo0Mattia Zago1Pantaleone Nespoli2Sergio López Bernal3Alberto Huertas Celdrán4Manuel Gil Pérez5José A. Ruipérez-Valiente6Gregorio Martínez Pérez7Félix Gómez Mármol8Department of Information Engineering and Communications, University of Murcia, Murcia, Spain; Corresponding authors.Department of Information Engineering and Communications, University of Murcia, Murcia, Spain; Corresponding authors.Department of Information Engineering and Communications, University of Murcia, Murcia, SpainDepartment of Information Engineering and Communications, University of Murcia, Murcia, SpainTelecommunication Software & Systems Group, Waterford Institute of Technology, Cork Rd, Waterford, IrelandDepartment of Information Engineering and Communications, University of Murcia, Murcia, SpainDepartment of Information Engineering and Communications, University of Murcia, Murcia, SpainDepartment of Information Engineering and Communications, University of Murcia, Murcia, SpainDepartment of Information Engineering and Communications, University of Murcia, Murcia, SpainThe term social bots refer to software-controlled accounts that actively participate in the social platforms to influence public opinion toward desired directions. To this extent, this data descriptor presents a Twitter dataset collected from October 4th to November 11th, 2019, within the context of the Spanish general election. Starting from 46 hashtags, the collection contains almost eight hundred thousand users involved in political discussions, with a total of 5.8 million tweets. The proposed data descriptor is related to the research article available at [1]. Its main objectives are: i) to enable worldwide researchers to improve the data gathering, organization, and preprocessing phases; ii) to test machine-learning-powered proposals; and, finally, iii) to improve state-of-the-art solutions on social bots detection, analysis, and classification. Note that the data are anonymized to preserve the privacy of the users. Throughout our analysis, we enriched the collected data with meaningful features in addition to the ones provided by Twitter. In particular, the tweets collection presents the tweets’ topic mentions and keywords (in the form of political bag-of-words), and the sentiment score. The users’ collection includes one field indicating the likelihood of one account being a bot. Furthermore, for those accounts classified as bots, it also includes a score that indicates the affinity to a political party and the followers/followings list.http://www.sciencedirect.com/science/article/pii/S2352340920309410Social bots detectionSocial bots classificationMachine learningSentiment analysisSocial network analysis |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Javier Pastor-Galindo Mattia Zago Pantaleone Nespoli Sergio López Bernal Alberto Huertas Celdrán Manuel Gil Pérez José A. Ruipérez-Valiente Gregorio Martínez Pérez Félix Gómez Mármol |
spellingShingle |
Javier Pastor-Galindo Mattia Zago Pantaleone Nespoli Sergio López Bernal Alberto Huertas Celdrán Manuel Gil Pérez José A. Ruipérez-Valiente Gregorio Martínez Pérez Félix Gómez Mármol Twitter social bots: The 2019 Spanish general election data Data in Brief Social bots detection Social bots classification Machine learning Sentiment analysis Social network analysis |
author_facet |
Javier Pastor-Galindo Mattia Zago Pantaleone Nespoli Sergio López Bernal Alberto Huertas Celdrán Manuel Gil Pérez José A. Ruipérez-Valiente Gregorio Martínez Pérez Félix Gómez Mármol |
author_sort |
Javier Pastor-Galindo |
title |
Twitter social bots: The 2019 Spanish general election data |
title_short |
Twitter social bots: The 2019 Spanish general election data |
title_full |
Twitter social bots: The 2019 Spanish general election data |
title_fullStr |
Twitter social bots: The 2019 Spanish general election data |
title_full_unstemmed |
Twitter social bots: The 2019 Spanish general election data |
title_sort |
twitter social bots: the 2019 spanish general election data |
publisher |
Elsevier |
series |
Data in Brief |
issn |
2352-3409 |
publishDate |
2020-10-01 |
description |
The term social bots refer to software-controlled accounts that actively participate in the social platforms to influence public opinion toward desired directions. To this extent, this data descriptor presents a Twitter dataset collected from October 4th to November 11th, 2019, within the context of the Spanish general election. Starting from 46 hashtags, the collection contains almost eight hundred thousand users involved in political discussions, with a total of 5.8 million tweets. The proposed data descriptor is related to the research article available at [1]. Its main objectives are: i) to enable worldwide researchers to improve the data gathering, organization, and preprocessing phases; ii) to test machine-learning-powered proposals; and, finally, iii) to improve state-of-the-art solutions on social bots detection, analysis, and classification. Note that the data are anonymized to preserve the privacy of the users. Throughout our analysis, we enriched the collected data with meaningful features in addition to the ones provided by Twitter. In particular, the tweets collection presents the tweets’ topic mentions and keywords (in the form of political bag-of-words), and the sentiment score. The users’ collection includes one field indicating the likelihood of one account being a bot. Furthermore, for those accounts classified as bots, it also includes a score that indicates the affinity to a political party and the followers/followings list. |
topic |
Social bots detection Social bots classification Machine learning Sentiment analysis Social network analysis |
url |
http://www.sciencedirect.com/science/article/pii/S2352340920309410 |
work_keys_str_mv |
AT javierpastorgalindo twittersocialbotsthe2019spanishgeneralelectiondata AT mattiazago twittersocialbotsthe2019spanishgeneralelectiondata AT pantaleonenespoli twittersocialbotsthe2019spanishgeneralelectiondata AT sergiolopezbernal twittersocialbotsthe2019spanishgeneralelectiondata AT albertohuertasceldran twittersocialbotsthe2019spanishgeneralelectiondata AT manuelgilperez twittersocialbotsthe2019spanishgeneralelectiondata AT josearuiperezvaliente twittersocialbotsthe2019spanishgeneralelectiondata AT gregoriomartinezperez twittersocialbotsthe2019spanishgeneralelectiondata AT felixgomezmarmol twittersocialbotsthe2019spanishgeneralelectiondata |
_version_ |
1724432050268667904 |